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CANCER-LINKED GENE AS TARGET FOR 
CHEMOTHERAPY 



This application claims priority of U.S. Provisional Patent Application 
60/386,651 , filed 6 June 2002, the disclosure of which is hereby incorporated 
by reference in its entirety. 

FIELD OF THE INVENTION 

The present invention relates to methods of screening cancer-linked 
genes and expression products for involvement in the cancer initiation and 
facilitation process as a means of cancer diagnosis as well as the use of such 
genes for screening potential anti-cancer agents, including small organic 
compounds and other molecules, and development of therapeutic agents. 



BACKGROUND OF THE INVENTION 

Cancer-linked genes are valuable in that they indicate genetic 
differences between cancer cells and normal cells, such as where a gene is 
expressed in a cancer cell but not in a non-cancer cell, or where said gene is 
over-expressed or expressed at a higher level in a cancer as opposed to 
normal or non-cancer cell. In addition, the expression of such a gene in a 
normal cell but not in a cancer cell, especially of the same type of tissue, can 
indicate important functions in the cancerous process. For example, screening 
assays for novel drugs are based on the response of model cell based 
systems in vitro to treatment with specific compounds. Such genes are also 
useful in the diagnosis of cancer and the identification of a cell as cancerous. 
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Gene activity is readily measured by measuring the rate of production of gene 
products, such as RNAs and polypeptides encoded by such genes. Where 
genes encode cell surface proteins, appearance of, or alterations in, such 
proteins, as cell surface markers, are an indication of neoplastic activity. 
5 Some such screens rely on specific genes, such as oncogenes (or gene 
mutations). In accordance with the present invention, a cancer-linked gene 
has been identified and its putative amino acid sequence worked out. Such 
gene is useful in the diagnosing of cancer, the screening of anticancer agents 
and the treatment of cancer using such agents, especially in that these genes 
1 0 encode polypeptides that can act as markers, such as cell surface markers, 
thereby providing ready targets for anti-tumor agents such as antibodies, 
preferably antibodies complexed to cytotoxic agents, including apoptotic 
agents. 

15 

BRIEF SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided herein a 
20 cancer specific gene, linked especially to prostate cancer, or otherwise 
involved in the cancer initiating and facilitating process and the derived amino 
acid sequence thereof, including a number of different transcripts derived from 
said gene. 

25 In one aspect, the present invention relates to a process for identifying 

an agent that modulates the activity of a cancer-related gene comprising: 

(a) contacting a compound with a cell containing a gene that 
corresponds to a polynucleotide having a sequence selected from the group 
consisting of SEQ ID NO: 1-18 and under conditions promoting the 

30 expression of said gene; and 

(b) detecting a difference in expression of said gene relative to when 
said compound is not present 
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thereby identifying an agent that modulates the activity of a cancer- 
related gene. 

In various embodiments of such a process, the cell is a cancer cell and 
5 the difference in expression is a decrease in expression. Such 
polynucleotides may also include those that have sequences identical to SEQ 
ID NO: 1 -18. 

In another aspect, the present invention relates to a process for 
10 identifying an anti-neoplastic agent comprising contacting a cell exhibiting 
neoplastic activity with a compound first identified as a cancer related gene 
modulator using an assay process disclosed herein and detecting a decrease 
in said neoplastic activity after said contacting compared to when said 
contacting does not occur. Such neoplastic activity may include accelerated 
1 5 cellular replication and/or metastasis, and the decrease in neoplastic activity 
preferably results from the death of the cell, or senescence, terminal 
differentiation or growth inhibition. 

The present invention also relates to a process for identifying an anti- 
20 neoplastic agent comprising administering to an animal exhibiting a cancer 
condition an effective amount of an agent first identified according to a 
process of one of one of the assays disclosed according to the invention and 
detecting a decrease in said cancerous condition. 

25 The present invention further relates to a process for determining the 

cancerous status of a cell, comprising determining an increase in the level of 
expression in said cell of at least one gene that corresponds to a 
polynucleotide having a sequence selected from the group consisting of SEQ 
ID NO: 1-18 wherein an elevated expression relative to a known non- 
30 cancerous cell indicates a cancerous state or potentially cancerous state. 
Such elevated expression may be due to an increased copy number. 
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The present invention additionally relates to an isolated polypeptide, 
encoded by one of the polynucleotide transcripts disclosed herein, comprising 
an amino acid sequence homologous to an amino acid sequence selected 
from the group consisting of SEQ ID NO: 19-26 wherein any difference 
5 between said amino acid sequence and the sequence of SEQ ID NO: 19-26 
is due solely to conservative amino acid substitutions and wherein said 
isolated polypeptide comprises at least one immunogenic fragment. In a 
preferred embodiment, the present invention encompasses an isolated 
polypeptide comprising an amino acid sequence homologous to an amino 
1 0 acid sequence selected from the group consisting of SEQ ID NO: 19-26. 

The present invention also relates to an antibody that reacts with a 
polypeptide as disclosed herein, preferably a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 19 - 
15 26. Such an antibody may be polyclonal, monoclonal, recombinant or 
synthetic in origin. 

In one such embodiment, said antibody is associated, either 
covalently or non-covalently, with a cytotoxic agent, for example, an 
20 apoptotic agent. Thus, the present invention relates to an 
immunoconjugate comprising an antibody of the invention and a cytotoxic 
agent. 

The present invention also relates to a process for treating cancer 
25 comprising contacting a cancerous cell with an agent having activity against 
an expression product encoded by a gene sequence selected from the group 
consisting of SEQ ID NO: 1 - 18. In one such embodiment, the cancerous cell 
is contacted in vivo. In another such embodiment, said agent has affinity for 
said expression product. In a preferred embodiment, such agent is an 
30 antibody disclosed herein, such as an antibody that is specific or selective for, 
or otherwise reacts with, a polypeptide of the invention. In a preferred 
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embodiment, the expression product is a polypeptide incorporating an amino 
acid sequence selected from SEQ ID NO: 19 - 26. 

The present invention further encompasses an immunogenic 
5 composition comprising a polypeptide disclosed herein, as well as 
compositions formed using antibodies specific for these polypeptides. 

The present invention is also directed to uses of such compositions. 
Such uses include a method for treating cancer in an animal afflicted 

10 therewith comprising administering to said animal an amount of an 
immunogenic composition of one or more of the polypeptides disclosed herein 
where such amount is an amount sufficient to elicit the production of cytotoxic 
T lymphocytes specific for a polypeptide of the invention, preferably a 
polypeptide incorporating a sequence of SEQ ID NO: 19-26. In a preferred 

1 5 embodiment, the animal to be so treated is a human patient. 



DEFINITIONS 

20 As used herein, the terms "portion," "segment," and "fragment," when 

used in relation to polypeptides, refer to a continuous sequence of residues, 
such as amino acid residues, which sequence forms a subset of a larger 
sequence. For example, if a polypeptide were subjected to treatment with any of 
the common endopeptidases, such as trypsin or chymotrypsin, the oligopeptides 

25 resulting from such treatment would represent portions, segments or fragments 
of the starting polypeptide. When used in relation to a polynucleotides, such 
terms refer to the products produced by treatment of said polynucleotides with 
any of the common endonucleases. 

30 As used herein, the term "isolated" means that the material is removed 

from its original environment (e.g., the natural environment if it is naturally 
occurring). It could also be produced recombinantly and subsequently purified. 

5 
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For example, a naturally-occurring polynucleotide or polypeptide present in a 
living animal is not isolated, but the same polynucleotide or polypeptide, 
separated from some or all of the coexisting materials in the natural system, is 
isolated. Such polynucleotides, for example, those prepared recombinantly, 
5 could be part of a vector and/or such polynucleotides or polypeptides could be 
part of a composition, and still be isolated in that such vector or composition is 
not part of its natural environment. In one embodiment of the present invention, 
such isolated, or purified, polypeptide is useful in generating antibodies for 
practicing the invention, or where said antibody is attached to a cytotoxic or 
1 0 cytolytic agent, such as an apoptotic agent. 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
sequence after alignment of the sequence to be compared (the "Compared 
1 5 Sequence") with the described or claimed sequence (the "Reference 
Sequence"). The Percent Identity is then determined according to the following 
formula: 

Percent Identity = 100[1-(C/R)] 

20 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in 
the Reference Sequence that does not have a corresponding aligned base or 

25 amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
constitutes a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 

30 Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 



6 



WO 03/104404 



PCT/US03/17772 



If an alignment exists between the Compared Sequence and the 
Reference Sequence for which the percent identity as calculated above is about 
equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 
5 Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 

As known in the art "similarity" between two polypeptides is determined 
10 by comparing the amino acid sequence and its conserved amino acid 
substitutes of one polypeptide to the sequence of a second polypeptide. 

In accordance with the present invention, the term "DNA segment" or 
"DNA sequence" refers to a DNA polymer, in the form of a separate fragment 

15 or as a component of a larger DNA construct, which has been derived from 
DNA isolated at least once in substantially pure form, i.e., free of 
contaminating endogenous materials and in a quantity or concentration 
enabling identification, manipulation, and recovery of the segment and its 
component nucleotide sequences by standard biochemical methods, for 

20 example, using a cloning vector. Such segments are provided in the form of 
an open reading frame uninterrupted by internal nontranslated sequences, or 
introns, which are typically present in eukaryotic genes. Sequences of non- 
translated DNA may be present downstream from the open reading frame, 
where the same do not interfere with manipulation or expression of the coding 

25 regions. 

The term "coding region" refers to that portion of a gene which either 
naturally or normally codes for the expression product of that gene in its 
natural genomic environment, i.e., the region coding in vivo for the native 
30 expression product of the gene. The coding region can be from a normal, 
mutated or altered gene, or can even be from a DNA sequence, or gene, 
wholly synthesized in the laboratory using methods well known to those of 
skill in the art of DNA synthesis. 
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In accordance with the present invention, the term "nucleotide 
sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, DNA 
segments encoding the proteins provided by this invention are assembled 
5 from cDNA fragments and short oligonucleotide linkers, or from a series of 
oligonucleotides, to provide a synthetic gene which is capable of being 
expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial, eukaryotic or viral operon. 

1 0 The term "expression product" means that polypeptide or protein that is 

the natural translation product of the gene and any nucleic acid sequence 
coding equivalents resulting from genetic code degeneracy and thus coding 
for the same amino acid(s). 

1 5 The term "active fragment," when referring to a coding sequence, means 

a portion comprising less than the complete coding region whose expression 
product retains essentially the same biological function or activity as the 
expression product of the complete coding region. 

20 The term "primer" means a short nucleic acid sequence that is paired 

with one strand of DNA and provides a free 3'-OH end at which a DNA 
polymerase starts synthesis of a deoxyribonucleotide chain. 

The term "promoter" means a region of DNA involved in binding of RNA 
25 polymerase to initiate transcription. The term "enhancer" refers to a region of 
DNA that, when present and active, has the effect of increasing expression of 
a different DNA sequence that is being expressed, thereby increasing the 
amount of expression product formed from said different DNA sequence. 

30 The term "open reading frame (ORF)" means a series of triplets coding 

for amino acids without any termination codons and is a sequence 
(potentially) translatable into protein. 
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As used herein, reference to a "DNA sequence" includes both single 
stranded and double stranded DNA. Thus, the specific sequence, unless the 
context indicates otherwise, refers to the single strand DNA of such 
5 sequence, the duplex of such sequence with its complement (double stranded 
DNA) and the complement of such sequence. 

As used herein, "corresponding genes" refers to genes that encode an 
RNA that is at least 90% identical, preferably at least 95% identical, most 

1 0 preferably at least 98% identical, and especially identical, to an RNA encoded 
by one of the nucleotide sequences disclosed herein (i.e., SEQ ID NO: 1 - 
18). Such genes will also encode the same polypeptide sequence as any of 
the sequences disclosed herein, preferably SEQ ID NO: 1-18, but may 
include differences in such amino acid sequences where such differences are 

15 limited to conservative amino acid substitutions, such as where the same 
overall three dimensional structure, and thus the same antigenic character, is 
maintained. Thus, amino acid sequences may be within the scope of the 
present invention where they react with the same antibodies that react with 
polypeptides comprising the sequences of SEQ ID NO: 19 - 26. A 

20 "corresponding gene" includes splice variants thereof. 

The genes identified by the present disclosure are considered "cancer- 
related" genes, as this term is used herein, and include genes expressed at 
higher levels (due, for example, to elevated rates of expression, elevated 

25 extent of expression or increased copy number) in cancer cells relative to 
expression of these genes in normal (i.e., non-cancerous) cells where said 
cancerous state or status of test cells or tissues has been determined by 
methods known in the art, such as by reverse transcriptase polymerase chain 
reaction (RT-PCR) as described in the Examples herein. In specific 

30 embodiments, this relates to the genes whose sequences correspond to the 
sequences of SEQ ID NO: 1-18. 
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As used herein, the term "conservative amino acid substitutions" are 
defined herein as exchanges within one of the following five groups: 
L Small aliphatic, nonpolar or slightly polar residues: 
Ala, Ser, Thr, Pro, Gly; 
5 II. Polar, negatively charged residues and their amides: 

Asp, Asn, Glu, Gin; 

III. Polar, positively charged residues: 

His, Arg, Lys; 

IV. Large, aliphatic, nonpolar residues: 
10 Met Leu, lie, Val, Cys 

V. Large, aromatic residues: 

Phe, Tyr, Trp 



DETAILED SUMMARY OF THE INVENTION 

The present invention relates to processes for utilizing a nucleotide 
20 sequence for a cancer-linked gene, polypeptides encoded by such sequences 
and antibodies reactive with such polypeptides in methods of treating and 
diagnosing cancer, preferably prostate cancer, and in carrying out screening 
assays for agents effective in reducing the activity of cancer-linked genes and 
thereby treating a cancerous condition. 

25 

The proteins useful in the invention are anion transport proteins and 
are specific for prostate tissue as well as being specific for prostate cancer. 
Thus, the presence of such antigens is diagnostic for prostate cancer. 

30 The polypeptides disclosed herein incorporate various polynucleotide 

transcripts (SEQ ID NO: 1-18) and the derived amino acid sequence (SEQ 
ID NO: 19-26) from said transcripts are available as targets for 
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chemotherapeutic agents, especially anti-cancer agents, including antibodies 
specific for said polypeptides. 

The cancer-related polynucleotide sequences disclosed herein 
5 correspond to gene sequences whose expression is indicative of the 
cancerous status of a given cell. Such sequences are substantially identical to 
SEQ ID NO: 1-18, which represent different transcripts identified from the 
GenBank EST database and which exhibit cancer-specific expression. The 
polynucleotides of the invention are those that correspond to a sequence of 
1 0 SEQ ID NO: 1-18. Such sequences have been searched within the GenBank 
database, especially the EST database, with the following results: 



15 



25 



Type: 



Tissue: 



AffyFragment-ID(s): 
20 Accession(s): 

Unigene cluster-ID(s): 



Chromosomal location: 



Cell-surface tumor antigen 
therapeutic antibody target 

prostate 

104504 

AF0712O2 

Hs.1 39336 

ATP-binding cassette, 
(CFTR/MRP), member 4 

13 



sub-family C 



The nucleotides and polypeptides, as gene products, used in the 
30 processes of the present invention may comprise a recombinant polynucleotide 
or polypeptide, a natural polynucleotide or polypeptide, or a synthetic 
polynucleotide or polypeptide, or a recombinant polynucleotide or polypeptide. 

Fragments of such polynucleotides and polypeptides as are disclosed 
35 herein may also be useful in practicing the processes of the present invention. 
For example, a fragment, derivative or analog of the polypeptide (SEQ ID NO: 
ig - 26) may be (i) one in which one or more of the amino acid residues are 
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substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or 
may not be one encoded by the genetic code, or (ii) one in which one or more of 
the amino acid residues includes a substituent group, or (iii) one in which the 
5 mature polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) 
one in which the additional amino acids are fused to the mature polypeptide, 
such as a leader or secretory sequence or a sequence which is employed for 
purification of the mature polypeptide (such as a histidine hexapeptide) or a 
1 0 proprotein sequence. Such fragments, derivatives and analogs are deemed to 
be within the scope of those skilled in the art from the teachings herein. 

In another aspect, the present invention relates to an isolated 
polypeptide, including a purified polypeptide, comprising an amino acid 

1 5 sequence at least 90% identical to the amino acid sequence of SEQ ID NO: 19 
- 26. In preferred embodiments, said isolated polypeptide comprises an amino 
acid sequence having sequence identity of at least 95%, preferably at least 
about 98%, and especially is identical to, the sequence of SEQ ID NO: 19-26. 
The present invention also includes isolated active fragments of such 

20 polypeptides where said fragments retain the biological activity of the 
polypeptide or where such active fragments are useful as specific targets for 
cancer treatment, prevention or diagnosis. Thus, the present invention relates to 
any polypeptides, or fragments thereof, with sufficient sequence homology to 
the sequences disclosed herein as to be useful in the production of antibodies 

25 that react with (i.e., are selective or specific for) the polypeptides of SEQ ID NO: 
19 - 26 so as to be useful in targeting cells that exhibit such polypeptides, or 
fragments, on their surfaces, thereby providing targets for such antibodies and 
therapeutic agents associated with such antibodies. 

30 The polynucleotides and polypeptides useful in practicing the processes 

of the present invention may likewise be obtained in an isolated or purified form. 
In addition, the polypeptide disclosed herein as being useful in practicing the 
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processes of the invention are believed to be surface proteins present on cells, 
such as cancerous cells. Precisely how such cancer-linked proteins are used in 
the processes of the invention may thus differ depending on the therapeutic 
approach used. For example, cell-surface proteins, such as receptors, are 
5 desirable targets for cytotoxic antibodies that can be generated against the 
polypeptides disclosed herein. 

The sequence information disclosed herein, as derived from the 
GenBank submissions, can readily be utilized by those skilled in the art to 
10 prepare the corresponding full-length polypeptide by peptide synthesis. The 
same is true for either the polynucleotides or polypeptides disclosed herein for 
use in the methods of the invention. 



The present invention relates to an isolated polypeptide, encoded by 
1 5 one of the polynucleotide transcripts disclosed herein, comprising an amino 
acid sequence homologous to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 19-26, wherein any difference between 
amino acid sequence in the isolated polypeptide and the sequence of SEQ ID 
NO: 19 - 26 is due solely to conservative amino acid substitutions and 
20 wherein said isolated polypeptide comprises at least one immunogenic 
fragment. In a preferred embodiment, the present invention encompasses an 
isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 19 - 26. 

25 Methods of producing recombinant cells and vectors useful in 

preparing the polynucleotides and polypeptides disclosed herein are well 
known to those skilled in the molecular biology art. See, for example, 
Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold 
Spring Harbor, N.Y., (1989), Wu et al., Methods in Gene Biotechnology (CRC 

30 Press, New York, NY, 1997), and Recombinant Gene Expression Protocols, 
in Methods in Molecular Biology, Vol. 62, (Tuan, ed., Humana Press, Totowa, 
NJ, 1997), the disclosures of which are hereby incorporated by reference. 
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In one aspect, the present invention relates to a process for identifying 
an agent that modulates the activity of a cancer-related gene comprising: 

(a) contacting a compound with a cell containing a gene that 
corresponds to a polynucleotide having a sequence selected from the group 

5 consisting of SEQ ID NO: 1-18 and under conditions promoting the 
' expression of said gene; and 

(b) detecting a difference in expression of said gene relative to when 
said compound is not present 

thereby identifying an agent that modulates the activity of a cancer- 
1 0 related gene. 

In specific embodiments of such process the cell is a cancer cell and 
the difference in expression is a decrease in expression. Such 
polynucleotides may also include those that have sequences identical to SEQ 
15 ID NO: 1 -18. 

In another aspect, the present invention relates to a process for 
identifying an antineoplastic agent comprising contacting a cell exhibiting 
neoplastic activity with a compound first identified as a cancer related gene 
20 modulator using an assay process disclosed herein and detecting a decrease 
in said neoplastic activity after said contacting compared to when said 
contacting does not occur. Such neoplastic activity may include accelerated 
cellular replication and/or metastasis, and the decrease in neoplastic activity 
preferably results from the death of the cell. 

25 

The present invention also relates to a process for identifying an anti- 
neoplastic agent comprising administering to an animal exhibiting a cancer 
condition an effective amount of an agent first identified according to a 
process of one of one of the assays disclosed according to the invention and 
30 detecting a decrease in said cancerous condition. 
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In specific embodiments of the present invention, the genes useful for 
the invention comprise genes that correspond to polynucleotides having a 
sequence selected from SEQ ID NO: 1 - 18, or may comprise the sequence of 
any of the polynucleotides disclosed herein (where the latter are cDNA 
5 sequences). 

In accordance with the present invention, such assays rely on methods 
of determining the activity of the gene in question. Such assays are 
advantageously based on model cellular systems using cancer cell lines, 

10 primary cancer cells, or cancerous tissue samples that are maintained in 
growth medium and treated with compounds at a single concentration or at a 
range of concentrations. At specific times after treatment, cellular RNAs are 
conveniently isolated from the treated cells or tissues, which RNAs are 
indicative of expression of selected genes. The cellular RNA is then divided 

1 5 and subjected to differential analysis that detects the presence and/or quantity 
of specific RNA transcripts, which transcripts may then be amplified for 
detection purposes using standard methodologies, such as, for example, 
reverse transcriptase polymerase chain reaction (RT-PCR), etc. The presence 
or absence, or concentration levels, of specific RNA transcripts are 

20 determined from these measurements. The polynucleotide sequences 
disclosed herein are readily used as probes for the detection of such RNA 
transcripts and thus the measurement of gene activity and expression. 

The polynucleotides of the invention can include fully operational genes 
25 with attendant control or regulatory sequences or merely a polynucleotide 
sequence encoding the corresponding polypeptide or an active fragment or 
analog thereof. 

Because expression of the polynucleotide sequences disclosed herein 
30 are specific to the cancerous state, useful gene modulation is downward 
modulation, so that, as a result of exposure to an antineoplastic agent 
identified by the screening assays herein, the corresponding gene of the 
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cancerous cell is expressed at a lower level (or not expressed at all) when 
exposed to the agent as compared to the expression when not exposed to the 
agent. For example, the gene sequences disclosed herein (SEQ ID NO: 1 - 
18) correspond to a gene expressed at a higher level in cells of prostate 
5 cancer than in normal prostate cells. Thus, where said chemical agent causes 
this gene of the tested cell to be expressed at a lower level than the same 
genes of the reference, this is indicative of downward modulation and 
indicates that the chemical agent to be tested has antineoplastic activity. 

1 0 In carrying out the assays disclosed herein, relative antineoplastic activity 

may be ascertained by the extent to which a given chemical agent modulates 
the expression of genes present in a cancerous cell. Thus, a first chemical agent 
that modulates the expression of a gene associated with the cancerous state 
(i.e., a gene corresponding to one or more of the polynucleotide transcripts 

1 5 disclosed herein) to a larger degree than a second chemical agent tested by the 
assays of the invention is thereby deemed to have higher, or more desirable, or 
more advantageous, anti-neoplastic activity than said second chemical agent. 

The gene expression to be measured is commonly assayed using RNA 
20 expression as an indicator. Thus, the greater the level of RNA (for example, 
messenger RNA or mRNA) detected the higher the level of expression of the 
corresponding gene. Thus, gene expression, either absolute or relative, is 
determined by the relative expression of the RNAs encoded by such genes. 

25 RNA may be isolated from samples in a variety of ways, including lysis 

and denaturation with a phenolic solution containing a chaotropic agent (e.g., 
trizol) followed by isopropanol precipitation, ethanol wash, and resuspension in 
aqueous solution; or lysis and denaturation followed by isolation on solid 
support, such as a Qiagen resin and reconstitution in aqueous solution; or lysis 

30 and denaturation in non-phenolic, aqueous solutions followed by enzymatic 
conversion of RNA to DNA template copies. 
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Normally, prior to applying the processes of the invention, steady state 
RNA expression levels for the genes, and sets of genes, disclosed herein will 
have been obtained. It is the steady state level of such expression that is 
affected by potential anti-neoplastic agents as determined herein. Such steady 
5 state levels of expression are easily determined by any methods that are 
sensitive, specific and accurate. Such methods include, but are in no way limited 
to, real time quantitative polymerase chain reaction (PCR), for example, using a 
Perkin-Elmer 7700 sequence detection system with gene specific primer probe 
combinations as designed using any of several commercially available software 
1 0 packages, such as Primer Express software., solid support based hybridization 
array technology using appropriate internal controls for quantitation, including 
filter, bead, or microchip based arrays, solid support based hybridization arrays 
using, for example, chemiluminescent, fluorescent, or electrochemical reaction 
based detection systems. 

15 

The gene expression indicative of a cancerous state need not be 
characteristic of every cell of a given tissue. Thus, the methods disclosed 
herein are useful for detecting the presence of a cancerous condition within a 
tissue where less than all cells exhibit the complete pattern. Thus, for 

20 example, a selected gene corresponding to the sequence of SEQ ID NO: 1, 
may be found, using appropriate probes, either DNA or RNA, to be present in 
as little as 60% of cells derived from a sample of tumorous, or malignant, 
tissue. In a highly preferred embodiment, such gene pattern is found to be 
present in at least 100% of cells drawn from a cancerous tissue and absent 

25 from at least 100% of a corresponding normal, non-cancerous, tissue sample. 

Expression of a gene may be related to copy number, and changes in 
expression may be measured by determining copy number. Such change in 
gene copy number may be determined by determining a change in expression 
30 of messenger RNA encoded by a particular gene sequence, especially that of 
SEQ ID NO: 1-18. Also in accordance with the present invention, said gene 
may be a cancer initiating or facilitating gene. In carrying out the methods of 
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the present invention, a cancer facilitating gene is a gene that, while not 
directly initiating tumor formation or growth, acts, such as through the actions 
of its expression product, to direct, enhance, or otherwise facilitate the 
progress of the cancerous condition, including where such gene acts against 
5 genes, or gene expression products, that would otherwise have the effect of 
decreasing tumor formation and/or growth. 

Although the expression of a gene corresponding to a sequence of 
SEQ ID NO: 1-18 may be indicative of a cancerous status for a given cell, 

1 0 the mere presence of such a gene may not alone be sufficient to achieve a 
malignant condition and thus the level of expression of such gene may also 
be a significant factor in determining the attainment of a cancerous state. 
Thus, it becomes essential to also determine the level of expression of a gene 
as disclosed herein, including substantially similar sequences, as a separate 

1 5 means of diagnosing the presence of a cancerous status for a given cell, 
groups of cells, or tissues, either in culture or in situ. 

The level of expression of the polypeptides disclosed herein is also a 
measure of gene expression, such as polypeptides having sequence identical, 
20 or similar to, any polypeptide encoded by a sequence of SEQ ID NO: 1-18, 
especially a polypeptide whose amino acid sequence is the sequence of SEQ 
ID NO: 19-26. 

In accordance with the foregoing, the present invention specifically 
25 contemplates a method for determining the cancerous status of a cell to be 
tested, comprising determining the level of expression in said cell of a gene 
that includes one of the nucleotide sequences selected from the sequences of 
SEQ ID NO: 1-18, including sequences substantially identical to said 
sequences, or characteristic fragments thereof, or the complements of any of 
30 the foregoing and then comparing said expression to that of a cell known to 
be non-cancerous whereby the difference in said expression indicates that 
said cell to be tested is cancerous. 
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In accordance with the invention, although gene expression for a gene 
that includes as a portion thereof one of the sequences of SEQ ID NO: 1-18, 
is preferably determined by use of a probe that is a fragment of such 
5 nucleotide sequence, it is to be understood that the probe may be formed 
from a different portion of the gene. Expression of the gene may be 
determined by use of a nucleotide probe that hybridizes to messenger RNA 
(mRNA) transcribed from a portion of the gene other than the specific 
nucleotide sequence disclosed herein. 

10 

It should be noted that there are a variety of different contexts in which 
genes have been evaluated as being involved in the cancerous process. 
Thus, some genes may be oncogenes and encode proteins that are directly 
involved in the cancerous process and thereby promote the occurrence of 

15 cancer in an animal. In addition, other genes may serve to suppress the 
cancerous state in a given cell or cell type and thereby work against a 
cancerous condition forming in an animal. Other genes may simply be 
involved either directly or indirectly in the cancerous process or condition and 
may serve in an ancillary capacity with respect to the cancerous state. All 

20 such types of genes are deemed with those to be determined in accordance 
with the invention as disclosed herein. Thus, the gene determined by said 
process of the invention may be an oncogene, or the gene determined by said 
process may be a cancer facilitating gene, the latter including a gene that 
directly or indirectly affects the cancerous process, either in the promotion of a 

25 cancerous condition or in facilitating the progress of cancerous growth or 
otherwise modulating the growth of cancer cells, either in vivo or ex vivo. In 
' addition, the gene determined by said process may be a cancer suppressor 
gene, which gene works either directly or indirectly to suppress the initiation or 
progress of a cancerous condition. Such genes may work indirectly where 

30 their expression alters the activity of some other gene or gene expression 
product that is itself directly involved in initiating or facilitating the progress of 
a cancerous condition. For example, a gene that encodes a polypeptide, 
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either wild or mutant in type, which polypeptide acts to suppress of tumor 
suppressor gene, or its expression product, will thereby act indirectly to 
promote tumor growth. 

5 As noted previously, polynucleotides encoding the same proteins as 

any of SEQ ID NO: 1-18, regardless of the percent identity of such 
sequences, are also specifically contemplated by any of the methods of the 
present invention that rely on any or all of said sequences, regardless of how 
they are otherwise described or limited. Thus, any such sequences are 
1 0 available for use in carrying out any of the methods disclosed according to the 
invention. Such sequences also include any open reading frames, as defined 
herein, present within the sequence of SEQ ID NO: 1 - 18. 



Because a gene disclosed according to the invention "corresponds to" 

15 a polynucleotide having a sequence of SEQ ID NO: 1-18, said gene encodes 
an RNA (processed or unprocessed, including naturally occurring splice 
variants and alleles) that is at least 90% identical, preferably at least 95% 
identical, most preferably at least 98% identical to, and especially identical to, 
an RNA that would be encoded by, or be complementary to, such as by 

20 hybridization with, a polynucleotide having the indicated sequence. In 
addition, genes including sequences at least 90% identical to a sequence 
selected from SEQ ID NO: 1 - 18, preferably at least about 95% identical to 
such a sequence, more preferably at least about 98% identical to such 
sequence and most preferably comprising such sequence are specifically 

25 contemplated by all of the processes of the present invention. Sequences 
encoding the same proteins as any of these sequences, regardless of the 
percent identity of such sequences, are also specifically contemplated by any 
of the methods of the present invention that rely on any or all of said 
sequences, regardless of how they are otherwise described or limited. The 

30 polynucleotide sequences of the invention also include any open reading 
frames, as defined herein, present within any of the sequences of SEQ ID 
NO: 1 - 18. 
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The sequences disclosed herein may be genomic in nature and thus 
represent the sequence of an actual gene, such as a human gene, or may be 
a cDNA sequence derived from a messenger RNA (mRNA) and thus 
5 represent contiguous exonic sequences derived from a corresponding 
genomic sequence, or they may be wholly synthetic in origin for purposes of 
practicing the processes of the invention. Because of the processing that may 
take place in transforming the initial RNA transcript into the final mRNA, the 
sequences disclosed herein may represent less than the full genomic 

1 0 sequence. They may also represent sequences derived from ribosomal and 
transfer RNAs. Consequently, the gene as present in the cell (and 
representing the genomic sequence) and the polynucleotide transcripts 
disclosed herein, including cDNA sequences, may be identical or may be 
such that the cDNAs contain less than the full genomic sequence. Such 

1 5 genes and cDNA sequences are still considered "corresponding sequences" 
(as defined elsewhere herein) because they both encode the same or related 
RNA sequences (i.e., related in the sense of being splice variants or RNAs at 
different stages of processing). Thus, by way of non-limiting example only, a 
gene that encodes an RNA transcript, which is then processed into a shorter 

20 mRNA, is deemed to encode both such RNAs and therefore encodes an RNA 
complementary to (using the usual Watson-Crick complementarity rules), or 
that would otherwise be encoded by, a cDNA (for example, a sequence as 
disclosed herein). Thus, the sequences disclosed herein correspond to genes 
contained in the cancerous cells (here, prostate cancer) and are used to 

25 determine gene activity or expression because they represent the same 
sequence or are complementary to RNAs encoded by the gene. Such a gene 
also includes different alleles and splice variants that may occur in the cells 
used in the methods of the invention, such as where recombinant cells are 
used to assay for anti-neoplastic agents and such cells have been engineered 

30 to express a polynucleotide as disclosed herein, including cells that have 
been engineered to express such polynucleotides at a higher level than is 
found in non-engineered cancerous cells or where such recombinant cells 
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express such polynucleotides only after having been engineered to do so. 
Such engineering includes genetic engineering, such as where one or more of 
the polynucleotides disclosed herein has been inserted into the genome of 
such cell or is present in a vector. 

5 

Such cells, especially mammalian cells, may also be engineered to 
express on their surfaces one or more of the polypeptides of the invention for 
testing with antibodies or other agents capable of masking such polypeptides 
and thereby removing the cancerous nature of the cell. Such engineering 
10 includes both genetic engineering, where the genetic complement of the cells 
is engineered to express the polypeptide, as well as non-genetic engineering, 
whereby the cell has been physically manipulated to incorporate a polypeptide 
of the invention in its plasma membrane, such as by direct insertion using 
chemical and/or other agents to achieve this result. 

15 

In accordance with the foregoing, the present invention includes anti- 
cancer agents that are themselves either polypeptides, or small chemical 
entities, that affect the cancerous process, including initiation, suppression or 
facilitation of tumor growth, either in vivo or ex vivo. Said cancer modulating 
20 agent will have the effect of decreasing gene expression. 

The present invention thus also relates to a method for treating cancer 
comprising contacting a cancerous cell with an agent having activity against 
an expression product encoded by a gene or polynucleotide sequence as 

25 disclosed herein, such as one having, or corresponding to, the nucleotide 
sequence of SEQ ID NO: 1-18. The present invention also relates to a 
process for treating cancer comprising contacting a cancerous cell with an 
agent having activity against an expression product encoded by a gene or 
polynucleotide sequence corresponding to a sequence selected from the 

30 group consisting of SEQ ID NO: 1 - 18. In one such embodiment, the 
cancerous cell is contacted in vivo. In another such embodiment, said agent 
has affinity for said expression product. In a preferred embodiment, such 
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agent is an antibody disclosed herein, such as an antibody that is specific or 
selective for, or otherwise reacts with, a polypeptide of the invention. In a 
preferred embodiment, the expression product is a polypeptide incorporating 
an amino acid sequence selected from SEQ ID NO: 19-26. 

5 

The present invention is also directed to such uses of the compositions 
of polypeptides and antibodies disclosed herein. Such uses include a process 
for treating cancer in an animal afflicted therewith comprising administering to 
said animal an amount of an immunogenic composition of one or more of the 
1 0 polypeptides disclosed herein where such amount if an amount sufficient to 
elicit the production of cytotoxic T lymphocytes specific for a polypeptide of 
the invention, preferably a polypeptide incorporating a sequence of SEQ ID 
NO: 19-26. In a preferred embodiment, the animal to be so treated is a 
human patient. 

15 

The proteins encoded by the genes disclosed herein due to their 
expression, or elevated expression, in cancer cells, represent highly useful 
therapeutic targets for "targeted therapies" utilizing such affinity structures as, 
for example, antibodies coupled to some cytotoxic agent. In such 

20 methodology, it is advantageous that nothing need be known about the 
endogenous ligands or binding partners for such cell surface molecules. 
Rather, an antibody or equivalent molecule that can specifically recognize the 
cell surface molecule (which could include an artificial peptide, a surrogate 
ligand, and the like) that is coupled to some agent that can induce cell death 

25 or a block in cell cycling offers therapeutic promise against these proteins. 
Thus, such approaches include the use of so-called suicide "bullets" against 
intracellular proteins. For example, monoclonal antibodies may readily by 
produced by methods well known in the art, for example, the method of Kohler 
and Milstein (see: Nature, 256:495 (1975). 

30 

With the advent of methods of molecular biology and recombinant 
technology, it is now possible to produce antibody molecules by recombinant 
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means and thereby generate gene sequences that code for specific amino 
acid sequences found in the polypeptide structure of the antibodies. Such 
antibodies can be produced by either cloning the gene sequences encoding 
the polypeptide chains of said antibodies or by direct synthesis of said 
5 polypeptide chains, with in vitro assembly of the synthesized chains to form 
active tetrameric (H 2 L 2 ) structures with affinity for specific epitopes and 
antigenic determinants. This has permitted the ready production of antibodies 
having sequences characteristic of neutralizing antibodies from different 
species and sources. 

10 Regardless of the source of the antibodies, or how they are 

recombinant^ constructed, or how they are synthesized, in vitro or in vivo, 
using transgenic animals, such as cows, goats and sheep, using large cell 
cultures of laboratory or commercial size, in bioreactors or by direct chemical 
synthesis employing no living organisms at any stage of the process, all 

1 5 antibodies have a similar overall 3 dimensional structure. This structure is 
often given as H 2 L 2 and refers to the fact that antibodies commonly comprise 
2 light (L) amino acid chains and 2 heavy (H) amino acid chains. Both chains 
have regions capable of interacting with a structurally complementary 
antigenic target. The regions interacting with the target are referred to as 

20 'Variable" or "V" regions and are characterized by differences in amino acid 
sequence from antibodies of different antigenic specificity. 

The variable regions of either H or L chains contains the amino acid 
sequences capable of specifically binding to antigenic targets. Within these 
25 sequences are smaller sequences dubbed "hypervariable" because of their 
extreme variability between antibodies of differing specificity. Such 
hypervariable regions are also referred to as "complementarity determining 
regions" or "CDR" regions. These CDR regions account for the basic 
specificity of the antibody for a particular antigenic determinant structure. 

30 

The CDRs represent non-contiguous stretches of amino acids within 
the variable regions but, regardless of species, the positional locations of 
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these critical amino acid sequences within the variable heavy and light chain 
regions have been found to have similar locations within the amino acid 
sequences of the variable chains. The variable heavy and light chains of all 
antibodies each have 3 CDR regions, each non-contiguous with the others 
5 (termed L1, L2, L3, H1, H2, H3) for the respective light (L) and heavy (H) 
chains. The accepted CDR regions have been described by Kabat et al., J. 
Biol. Chem. 252:6609-6616 (1977). 

In all mammalian species, antibody polypeptides contain constant (i.e., 
10 highly conserved) and variable regions, and, within the latter, there are the 
CDRs and the so-called "framework regions" made up of amino acid 
sequences within the variable region of the heavy or light chain but outside 
the CDRs. 

1 5 The antibodies disclosed according to the invention may also be wholly 

synthetic, wherein the polypeptide chains of the antibodies are synthesized 
and, possibly, optimized for binding to the polypeptides disclosed herein as 
being receptors. Such antibodies may be chimeric or humanized antibodies 
and may be fully tetrameric in structure, or may be dimeric and comprise only 

20 a single heavy and a single light chain. Such antibodies may also include 
fragments, such as Fab and F(ab 2 )' fragments, capable of reacting with and 
binding to any of the polypeptides disclosed herein as being receptors. 

In one aspect, the present invention relates to immunoglobulins, or 
25 antibodies, as described herein, that react with, especially where they are 
specific for, the polypeptides having amino acid sequences as disclosed 
herein, preferably those having an amino acid sequence of one of SEQ ID 
NO: 19-26. Such antibodies may commonly be in the form of a composition, 
especially a pharmaceutical composition. Such antibodies, by themselves, 
30 may have therapeutic value in that they are able to bind to, and thereby tie up, 
surface sites on cancerous cells. Where such sites have some type of 
function to perform (i.e., where they are surface enzymes, or channel 
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structures, or structures that otherwise facilitate, actively or passively, the 
transport of nutrients and other vital materials to the cell. Such nutrients serve 
to facilitate the growth and replication of the cell and molecules that bind to 
such sites and thereby interfere with such activities can prove to have a 
5 therapeutic effect in that the result of such binding is to remove sources of 
nutrients from such cells, thereby interfering with growth and replication. In 
like manner, such binding may serve to remove vital enzyme activities from 
the cell's functional repertoire, thereby also interfering with viability and/or the 
ability of the cell to multiply or metastasize. In addition, by binding to such 

1 0 surface sites, the antibodies may serve to prevent the cells from reacting to 
environmental agents, such as cytokines and the like, that may facilitate 
growth, replication and metastasis, thereby further reducing the cancerous 
status of such cell and ameliorating the cancerous condition in a patient, even 
without proving fatal to the cell or cells so affected. 

1 5 The methods of the present invention also include processes wherein 

the cancer cell is contacted in vivo as well as ex vivo with an agent that 
comprises a portion, or is part of an overall molecular structure, having affinity 
for an expression product of a gene corresponding to a polynucleotide 
sequence as disclosed herein, preferably where the expression product is a 

20 cell surface structure, most preferably a polypeptide as disclosed herein, such 
as one that comprises an amino acid sequence of SEQ ID NO: 19-26. In one 
such embodiment, said portion having affinity for said expression product is 
an antibody, especially where said expression product is a polypeptide or 
oligopeptide or comprises an oligopeptide portion, or comprises a polypeptide. 

25 

In another aspect, the present invention also relates to an antibody that 
reacts with a polypeptide as disclosed herein, preferably a polypeptide 
comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 19-26. Such an antibody may be polyclonal, monoclonal, 
30 recombinant or synthetic in origin. In one such embodiment, said antibody is 
associated, either covalently or non-covalently, with a cytotoxic agent, for 
example, an apoptotic agent. It is thus contemplated that the antibody acts a 
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targeted vector for guiding an associated therapeutic agent to a cancerous 
cell, such as a cell expressing a polypeptide homologous to, if not identical to, 
a polypeptide as disclosed herein. 

5 Where the cytotoxic agent is itself a polypeptide, said may be linked 

directly to an antibody specific for a surface target on a cancer cell, such as 
where the polypeptide represents an extension of the amino acid chain of the 
antibody. In alternative embodiments, such molecules may be covalently 
linked through a linker sequence of long or short duration, such as an amino 

10 acid sequence of 5 to 10 residues in length. Where the cytotoxic agents is 
some small organic molecule, such as a small organic compound, or some 
type of apoptotic agent, this may be covalently bonded to the antibody 
molecule or may be attached by some other type of non-covalent linkage, 
including hydrophobic and electrostatic linkages. Methods for forming such 

1 5 linkages, especially covalent linkages, are well known to those skilled in the 
art. 

The antibodies disclosed herein may also serve as targeting vectors for 
much larger structures, such as liposomes. In one such embodiment, an 

20 antibody is part of, or otherwise linked to, or associated with, a membranous 
structure, preferably a liposome or possibly some type of cellular organelle, 
which acts as a reservoir for a cytotoxic agent, such as ricin. The antibody 
then acts to target said liposome to a cancerous tissue in an animal, 
whereupon the liposome provides a source of cytotoxic agents for localized 

25 treatment of a solid tumor or other type of neoplasm. 

The present invention further encompasses an immunogenic 
composition comprising a polypeptide disclosed herein, as well as 
compositions formed using antibodies specific for these polypeptides. 

30 

Methods well known in the art for making formulations are found in, for 
example, Remington: The Science and Practice of Pharmacy, (19th ed.) Ed. 
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A.R. Gennaro, 1995, Mack Publishing Company, Easton, PA. Formulations 
for parenteral administration may, for example, contain excipients, sterile 
water, or saline, polyalkylene glycols such as polyethylene glycol, oils of 
vegetable origin, or hydrogenated napthalenes. Biocompatible, biodegradable 
5 lactide polymer, lactide/glycolide copolymer, or polyoxyethylene- 
polyoxypropylene copolymers may be used to control the release of the 
compounds. Other potentially useful parenteral delivery systems for agonists 
of the invention include ethylenevinyl acetate copolymer particles, osmotic 
pumps, implantable infusion systems, and liposomes. Formulations for 

10 inhalation may contain excipients, or example, lactose, or may be aqueous 
solutions containing, for example, polyoxyethylene-9-lauryl ether, glycocholate 
and deoxycholate, or may be oily solutions for administration in the form of 
nasal drops, or as a gel. It should be noted that, where the therapeutic agent 
to be administered is an immunoconjugate, these sometimes contain 

1 5 chemical linkages that are somewhat labile in aqueous media and therefor 
must be stored prior to administration is a more stable environment, such as 
in the form of a lyophilized powder. 

Such an agent can be a single molecular structure, comprising both 
20 affinity portion and anti-cancer activity portions, wherein said portions are 
derived from separate molecules, or molecular structures, possessing such 
activity when separated and wherein such agent has been formed by 
combining said portions into one larger molecular structure, such as where 
said portions are combined into the form of an adduct. Said anti-cancer and 
25 affinity portions may be joined covalently, such as in the form of a single 
polypeptide, or polypeptide-like, structure or may be joined non-covalently, 
such as by hydrophobic or electrostatic interactions, such structures having 
been formed by means well known in the chemical arts. Alternatively, the anti- 
cancer and affinity portions may be formed from separate domains of a single 
30 molecule that exhibits, as part of the same chemical structure, more than one 
activity wherein one of the activities is against cancer cells, or tumor formation 
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or growth, and the other activity is affinity for an expression product produced 
by expression of genes related to the cancerous process or condition. 

In one embodiment of the present invention, a chemical agent, such as 
5 a protein or other polypeptide, is joined to an agent, such as an antibody, 
having affinity for an expression product of a cancerous cell, such as a 
polypeptide or protein encoded by a gene related to the cancerous process, 
preferably a gene as disclosed herein according to the present invention, 
most preferably a polypeptide sequence disclosed herein. Thus, where the 

10 presence of said expression product is essential to tumor initiation and/or 
growth, binding of said agent to said expression product will have the effect of 
negating said tumor promoting activity. In one such embodiment, said agent is 
an apoptosis-inducing agent that induces cell suicide, thereby killing the 
cancer cell and halting tumor growth. 

15 Other genes within the cancer cell that are regulated in a manner 

similar to that of the genes disclosed herein and thus change their expression 
in a coordinated way in response to chemical compounds represent genes 
that are located within a common metabolic, signaling, physiological, or 
functional pathway so that by analyzing and identifying such commonly 

20 regulated groups of genes (groups that include the gene, or similar 
sequences, disclosed according to the invention, one can (a) assign known 
genes and novel genes to specific pathways and (b) identify specific functions 
and functional roles for novel genes that are grouped into pathways with 
genes for which their functions are already characterized or described. For 

25 example, one might identify a group of 10 genes, at least one of which is the 
gene as disclosed herein, that change expression in a coordinated fashion 
and for which the function of one, such as the polypeptide encoded by the 
sequence disclosed herein, is known then the other genes are thereby 
implicated in a similar function or pathway and may thus play a role in the 

30 cancer-initiating or cancer-facilitating process. In the same way, if a gene 
were found in normal cells but not in cancer cells, or happens to be expressed 
at a higher level in normal as opposed to cancer cells, then a similar 
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conclusion may be drawn as to its involvement in cancer, or other diseases. 
Therefore, the processes disclosed according to the present invention at once 
provide a novel means of assigning function to genes, i.e. a novel method of 
functional genomics, and a means for identifying chemical compounds that 
5 have potential therapeutic effects on specific cellular pathways. Such 
chemical compounds may have therapeutic relevance to a variety of diseases 
outside of cancer as well, in cases where such diseases are known or are 
demonstrated to involve the specific cellular pathway that is affected. 

10 The polypeptides disclosed herein, preferably those of SEQ ID NO: 19 

- 26, also find use as vaccines in that, where the polypeptide represents a 
surface protein present on a cancer cell, such polypeptide may be 
administered to an animal, especially a human being, for purposes of 
activating cytotoxic T lymphocytes (CTLs) that will be specific for, and act to 

1 5 lyze, cancer cells in said animal. Where used as vaccines, such polypeptides 
are present in the form of a pharmaceutical composition. The present 
invention may also employ polypeptides that have the same, or similar, 
immunogenic character as the polypeptides of SEQ ID NO: 19-26 and 
thereby elicit the same, or similar, immunogenic response after administration 

20 to an animal, such as an animal at risk of developing cancer, or afflicted 
therewith. Thus, the polypeptides disclosed according to the invention will 
commonly find use as immunogenic compositions. 

Expression of a gene corresponding to a polynucleotide disclosed 
25 herein, when in normal tissues, may indicate a predisposition towards 
development of prostate cancer. The encoded polypeptide might then present 
a potentially useful cell surface target for therapeutic molecules such as 
cytolytic antibodies, or antibodies attached to cytotoxic, or cytolytic, agents. . 

30 The present invention specifically contemplates use of antibodies 

against the polypeptides encoded by the polynucleotides corresponding to the 
genes disclosed herein, whereby said antibodies are conjugates to one or 
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more cytotoxic agents so that the antibodies serve to target the conjugated 
immunotoxins to a region of cancerous activity, such as a solid tumor. For 
many known cytotoxic agents, lack of selectivity has presented a drawback to 
their use as therapeutic agents in the treatment of malignancies. For example, 
5 the class of two-chain toxins, consisting of a binding subunit (or B-chain) 
linked to a toxic subunit (A-chain) are extremely cytotoxic. Thus, such agents 
as ricin, a protein isolated from castor beans, kills cells at very low 
concentrations (even less than 10" 11 M) by inactivating ribosomes in said cells 
(see, for example, Lord et al., Ricin: structure, mode of action, and some 

10 current applications. Faseb J, 8: 201-208 (1994), and Blattler et al., Realizing 
the full potential of immunotoxins. Cancer Cells, 1; 50-55 (1989)). While 
isolated A-chains of protein toxins that functionally resemble ricin A-chain are 
only weakly cytotoxic for intact cells (in the concentration range of 10" 7 to 10" 6 
M), they are very potent cytotoxic agents inside the cells. Thus, a single 

1 5 molecule of the A-subunit of diphtheria toxin can kill a cell once inside (see: 
Yamaizumi et al., One molecule of diphtheria toxin fragment A introduced into 
a cell can kill the cell. Cell, 15: 245-250, 1978). 

The present invention solves this selectivity problem by using 
antibodies specific for antigens present on cancer cells to target the cytotoxins 

20 to said cells. In addition, use of antibodies decreases toxicity because the 
antibodies are non-toxic until they reach the tumor and, because the cytotoxin 
is bound to the antibody, it is presented with less opportunity to cause 
damage to non-targeted tissues. 

25 In addition, use of such antibodies alone can provide therapeutic 

effects on the tumor through the antibody-dependent cellular cytotoxic 
response (ADCC) and complement-mediated cell lysis mechanisms. 

A number of recombinant immunotoxins (for example, consisting of Fv 
30 regions of cancer specific antibodies fused to truncated bacterial toxins) are 
well known (see, for example, Smyth et al., Specific targeting of chlorambucil 
to tumors with the use of monoclonal antibodies, J. Natl. Cancer Inst, 
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76(3):503-510 (1986); Cho et al., Single-chain Fv/folate conjugates mediate 
efficient lysis of folate-receptor-positive tumor cells, Bioconjug. Chem., 
8(3):338-346 (1997)). As noted in the literature, these may contain, for 
example, a truncated version of Pseudomonas exotoxin as a toxic moiety but 
5 the toxin is modified in such a manner that by itself it does not bind to normal 
human cells, but it retains all other functions of cytotoxicity. Here, recombinant 
antibody fragments target the modified toxin to cancer cells which are killed, 
such as by direct inhibition of protein synthesis, or by concomitant induction of 
apoptosis. Cells that are not recognized by the antibody fragment, because 

10 they do not carry the cancer antigen, are not affected. Good activity and 
specificity has been observed for many recombinant immunotoxins in in vitro 
assays using cultured cancer cells as well as in animal tumor models. 
Ongoing clinical trials provide examples where the promising pre-clinical data 
correlate with successful results in experimental cancer therapy, (see, for 

15 example, Brinkmann U. f Recombinant antibody fragments and immunotoxin 
fusions for cancer therapy, In Vivo (2000) 14:21-27). 

While the safety of employing immunoconjugates in humans has been 
established, in vivo therapeutic results have been less impressive. Because 

20 clinical use of mouse MAbs in humans is limited by the development of a 
foreign anti-globulin immune response by the human host, genetically 
engineered chimeric human-mouse MAbs have been developed by replacing 
the mouse Fc region with the human constant region. In other cases, the 
mouse antibodies have been "humanized" by replacing the framework regions 

25 of variable domains of rodent antibodies by their human equivalents. Such 
humanized and engineered antibodies can even be structurally arranged to 
have specificities and effector functions determined by design and which 
characteristics do not appear in nature. The development of bispecific 
antibodies, having different binding ends so that more than one antigenic site 

30 can be bound, have proven useful in targeting cancer cells. Thus, such 
antibody specificity has been improved by chemical coupling to various 
agents such as bacterial or plant toxins, radionuclides or cytotoxic drugs and 
other agents, (see, for example, Bodey, B. et al). Genetically engineered 
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monoclonal antibodies for direct antineoplastic treatment and cancer cell 
specific delivery of chemotherapeutic agents. Curr Pharm Des (2000) 
Feb;6(3):261-76). See also, Garnett, M. C, Targeted drug conjugates: 
principles and progress. Adv. Drug Deliv. Rev. (2001 Dec 17) 53(2): 171 -2 16; 
5 Brinkmann et al., Recombinant immunotoxins for cancer therapy. Expert Opin 
Biol Ther. (2001) 1(4):693-702. 

Among the cytotoxic agents specifically contemplated for use as 
immunoconjugates according to the present invention are Calicheamicin, a 

10 highly toxic enediyne antibiotic isolated from Micromonospora 
echinospora ssp. Calichensis, and which binds to the minor groove of 
DNA to induce double strand breaks and cell death (see: Lee et al., 
Calicheamicins, a novel family of antitumor antibiotics. 1. Chemistry and 
partial structure of calichemicin gi. J Am Chem Soc, 109: 3464-3466 (1987); 

15 Zein et al., Calicheamicin gamma 11: an antitumor antibiotic that cleaves 
double-stranded DNA site specifically, Science, 240: 1198-1201 (1988)). 
Useful derivatives of the calicheamicins include mylotarg and 138H11-Cam9. 
Mylotarg is an immunoconjugate of a humanized anti-CD33 antibody (CD33 
being found in leukemic cells of most patients with acute myeloid leukemia) 

20 and N-acetyl gamma colicheamicin dimethyl hydrazide, the latter of which is 
readily coupled to an antibody of the present invention (in place of the anti- 
CD33 but which can also be humanized by substitution of human framework 
regions into the antibody during production as described elsewhere herein) to 
form an immunoconjugate of the invention, (see: Hamann et al. Gemtuzumab 

25 Ozogamicin, A Potent and Selective Anti-CD33 Antibody-Calicheamicin 
Conjugate for Treatment of Acute Myeloid Leukemia, Bioconjug. Chem. 13, 
47-58 (2002)) For use with 138H11-Came, 138H11 is an anti-y-glutamyl 
transferase antibody coupled to theta calicheamicin through a disulfide 
linkage and found useful in vitro against cultured renal cell carcinoma cells. 

30 (see: Knoll et al., Targeted therapy of experimental renal cell carcinoma with a 
novel conjugate of monoclonal antibody 138H11 and calicheamicin 8i\ 
Cancer Res, 60: 6089-6094 (2000) The same linkage may be utilized to link 
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this cytotoxic agent to an antibody of the present invention, thereby forming a 
targeting structure for prostate cancer cells. 

Also useful in forming the immunoconjugates of the invention is DC1, a 
5 disulfide-containing analog of adozelesin, that kills cells by binding to the 
minor groove of DNA, followed by alkylation of adenine bases. Adozelesin is a 
structural analog of CC-1065, an anti-tumor antibiotic isolated from microbial 
fermentation of Streptomyces zelensis, and is about 1,000 fold more toxic to 
cultured cell lines that other DNA interacting agents, such as cis-platin and 

10 doxorubicin. This agent is readily linked to antibodies through the disulfide 
bond of adozelesin. (see: Chari et al. f Enhancement of the selectivity and 
antitumor efficacy of a CC-1065 analogue through immunoconjugate 
formation, Cancer Res, 55:4079-4084 (1995)). 

Maytansine, a highly cytotoxic microtubular inhibitor isolated from the 

1 5 shrub Maytenus serrata found to have little value in human clinical trials, is 
much more effective in its derivatized form, denoted DM1, containing a 
disulfide bond to facilitate linkage to antibodies, is up to 10-fold more cytotoxic 
(see: Chari et al., Immunoconjugates containing novel maytansinoids: 
promising anticancer drugs, Cancer Res, 52: 127-131 (1992)). These same in 

20 vitro studies showed that up to four DM1 molecules could be linked to a single 
immunoglobulin without destroying the binding affinity. Such conjugates have 
been used against breiast cancer antigens, such as the neu/HER2/erbB-2 
antigen, (see: Goldmacher et al., Immunogen, Inc., (2002) in press); also see 
Liu, C. et al., Eradication of large colon tumor xenografts by targeted delivery 

25 of maytansinoids, Proc. Natl. Acad. Sci. USA, 93, 8618-8623 (1996)). For 
example, Liu et al. (1996) describes formation of an immunoconjugate of the 
maytansinoid cytotoxin DM1 and C242 antibody, a murine lgG1 
immunoglobulin, available from Pharmacia and which has affinity for a mucin- 
like glycoprotein variably expressed by human colorectal cancers. The latter 

30 immunoconjugate was prepared according to Chari et al., Cancer Res., 
52:127-131 (1992) and was found to be highly cytotoxic against cultured colon 
cancer cells as well as showing anti-tumor effects in vivo in mice bearing 
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subcutaneous COLO 205 human colon tumor xenografts using doses well 
below the maximum tolerated dose. 

In addition, there are a variety of protein toxins (cytotoxic proteins), 
5 which include a number of different classes, such as those that inhibit protein 
synthesis: ribosome-inactivating proteins of plant origin, such as ricin, abrin, 
gelonin, and a number of others, and bacterial toxins such as pseudomonas 
exotoxin and diphtheria toxin. 

10 Another useful class is the one including taxol, taxotere, and taxoids. 

Specific examples include paclitaxel (taxol), its analog docetaxel (taxotere), 
and derivatives thereof. The first two are clinical drugs used in treating a 
number of tumors while the taxoids act to induce cell death by inhibiting the 
de-polymerization of tubulin. Such agents are readily linked to antibodies 

15 through disulfide bonds without disadvantageous effects on binding 
specificity. 

In one instance, a truncated Pseudomonas exotoxin was fused to an 
anti-CD22 variable fragment and used successfully to treat patients with 

20 chemotherapy-resistant hairy-cell leukemia, (see: Kreitman et al., Efficacy of 
the anti-CD22 recombinant immunotoxin BL22 in chemotherapy-resistant 
hairy-cell leukemia, N Engl J Med, 345. 241-247 (2001)) Conversely, the 
cancer-linked peptides of the present invention offer the opportunity to 
prepare antibodies, recombinant or otherwise, against the appropriate 

25 antigens to target solid tumors, preferably those of malignancies of prostate 
tissue, using the same or similar cytotoxic conjugates. Thus, many of the 
previously used immunoconjugates have been formed using antibodies 
against general antigenic sites linked to cancers whereas the antibodies 
formed using the peptides disclosed herein are more specific and target the 

30 antibody-cytotoxic agent to a particular tissue or organ, thus further reducing 
toxicity and other undesirable side effects. 
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In addition, the immunoconjugates formed using the antibodies 
prepared against the cancer-linked antigens disclosed herein can be formed 
by any type of chemical coupling. Thus, the cytotoxic agent of choice, along 
with the immunoglobulin, can be coupled by any type of chemical linkage, 
5 covalent or non-covalent, including electrostatic linkage, to form the 
immunoconjugates of the present invention. 

When used as immunoconjugates, the antitumor agents of the present 
invention represent a class of pro-drugs that are relatively non-toxic when first 

10 administered to an animal (due mostly to the stability of the 
immunoconjugate), such as a human patient, but which are targeted by the 
conjugated immunoglobulin to a cancer cell where they then exhibit good 
toxicity. The tumor-related, associated, or linked, antigens, preferably those 
presented herein, serve as targets for the antibodies (monoclonal, 

1 5 recombinant, and the like) specific for said antigens. The end result is the 
release of active cytotoxic agent inside the cell after binding of the 
immunoglobulin portion of the immunoconjugate. 

The cited references describe a number of useful procedures for the 
20 chemical linkage of cytotoxic agents to immunoglobulins and the disclosures 
of all such references cited herein are hereby incorporated by reference in 
their entirety. For other reviews see Ghetie et al., Immunotoxins in the therapy 
of cancer: from bench to clinic, Pharmacol Ther t 63: 209-234 (1994), Pietersz 
et al. The use of monoclonal antibody immunoconjugates in cancer therapy, 
25 Adv Exp Med Biol, 353:169-179 (1994), and Pietersz, G. A. The linkage of 
cytotoxic drugs to monoclonal antibodies for the treatment of cancer, 
Bioconjug Chem, 1:89-95 (1990). 

Thus, the present invention provides highly useful cancer-associated 
30 antigens for generation of antibodies for linkage to a number of different 
cytotoxic agents which are already known to have some in vitro toxicity and 
possess chemical groups available for linkage to antibodies. 
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Because the polypeptides disclosed herein are diagnostic for prostate 
cancer, the present invention offers a method for diagnosing prostate cancer 
in a patient comprising determining the presence of a cancer-related 
5 polypeptide in one or more prostate cells of said patient wherein said cancer- 
related polypeptide comprises a polypeptide that reacts with an antibody 
specific for a polypeptide having an amino acid sequence selected from SEQ 
ID NO: 19 - 26. In a preferred embodiment thereof, the cancer-related 
polypeptide comprises a polypeptide of SEQ ID NO: 19 - 26. In a highly 
1 0 preferred embodiment, the cancer-related polypeptide is a member selected 
from the group consisting of SEQ ID NO: 19 - 26. 

The present invention also relates to a process that comprises a 
method for producing a product, including the generation of test data, 

1 5 comprising identifying an agent according to one of the disclosed processes 
for identifying such an agent (i.e., the therapeutic agents identified according 
to the assay procedures disclosed herein) wherein said product is the data 
collected with respect to said agent as a result of said identification process, 
or assay, and wherein said data is sufficient to convey the chemical character 

20 and/or structure and/or properties of said agent. For example, the present 
invention specifically contemplates a situation whereby a user of an assay of 
the invention may use the assay to screen for compounds having the desired 
enzyme modulating activity and, having identified the compound, then 
conveys that information (i.e., information as to structure, dosage, etc) to 

25 another user who then utilizes the information to reproduce the agent and 
administer it for therapeutic or research purposes according to the invention. 
For example, the user of the assay (user 1) may screen a number of test 
compounds without knowing the structure or identity of the compounds (such 
as where a number of code numbers are used the first user is simply given 

30 samples labeled with said code numbers) and, after performing the screening 
process, using one or more assay processes of the present invention, then 
imparts to a second user (user 2), verbally or in writing or some equivalent 
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fashion, sufficient information to identify the compounds having a particular 
modulating activity (for example, the code number with the corresponding 
results). This transmission of information from user 1 to user 2 is specifically 
contemplated by the present invention. 

5 

It should be cautioned that, in carrying out the procedures of the 
present invention as disclosed herein, whether to form immunoconjugates or 
screen for other antitumor agents using the genes and polypeptides disclosed 
herein, any reference to particular buffers, media, reagents, cells, culture 

1 0 conditions and the like are not intended to be limiting, but are to be read so as 
to include all related materials that one of ordinary skill in the art would 
recognize as being of interest or value in the particular context in which that 
discussion is presented. For example, it is often possible to substitute one 
buffer system or culture medium for another and still achieve similar, if not 

1 5 identical, results. Those of skill in the art will have sufficient knowledge of 
such systems and methodologies so as to be able, without undue 
experimentation, to make such substitutions as will optimally serve their 
purposes in using the methods and procedures disclosed herein. 

20 The present invention will now be further described by way of the 

following non-limiting example. In applying the disclosure of the example, it 
should be kept clearly in mind that other and different embodiments of the 
methods disclosed according to the present invention will no doubt suggest 
themselves to those of skill in the relevant art. The following example shows 

25 how a potential antineoplastic agent may be identified using one or more of 
the genes disclosed herein. 



EXAMPLE 

30 

Determination of Gene Inhibitory Activity of an Antineoplastic Agent 
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SW480 cells are grown to a density of 10 5 cells/cm 2 in Leibovitz's L-15 
medium supplemented with 2 mM L-glutamine (90%) and 10% fetal bovine 
serum. The cells are collected after treatment with 0.25% trypsin, 0.02% 
EDTA at 37°C for 2 to 5 minutes. The trypsinized cells are then diluted with 30 
5 ml growth medium and plated at a density of 50,000 cells per well in a 96 well 
plate (100 ^I/well). The following day, cells are treated with either compound 
buffer alone, or compound buffer containing a chemical agent to be tested, for 
24 hours. The media is then removed, the cells lysed and the RNA recovered 
using the RNAeasy reagents and protocol obtained from Qiagen. RNA is 

1 0 quantitated and 10 ng of sample in 1 jil are added to 24 ul of Taqman reaction 
mix containing 1X PCR buffer, RNAsin, reverse transcriptase, nucleoside 
triphosphates, amplitaq gold, tween 20, glycerol, bovine serum albumin (BSA) 
and specific PCR primers and probes for a reference gene (18S RNA) and a 
test gene (Gene X). Reverse transcription is then carried out at 48°C for 30 

15 minutes. The sample is then applied to a Perlin Elmer 7700 sequence 
detector and heat denatured for 10 minutes at 95°C. Amplification is 
performed through 40 cycles using 15 seconds annealing at 60°C followed by 
a 60 second extension at 72°C and 30 second denaturation at 95°C. Data 
files are then captured and the data analyzed with the appropriate baseline 

20 windows and thresholds. 

The quantitative difference between the target and reference gene is 
then calculated and a relative expression value determined for all of the 
samples used. In this way, the ability of a chemotherapeutic agent to 
25 effectively and selectively reduce the activity of a cancer-specific gene is 
readily ascertained. The overall expression of the cancer-specific gene, as 
modulated by one chemical agent relative to another, is also determined. 
Chemical agents having the most effect in reducing gene activity are thereby 
identified as the most anti-neoplastic. 
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1 . A process for identifying an agent that modulates the activity of a 
cancer-related gene comprising: 
5 (a) contacting a compound with a cell containing a gene that 

corresponds to a polynucleotide having a sequence selected from the group 
consisting of SEQ ID NO: 1-18 and under conditions promoting the 
expression of said gene; and 

(b) detecting a difference in expression of said gene relative to when 
1 0 said compound is not present 

thereby identifying an agent that modulates the activity of a cancer- 
related gene. 

2 The process of claim 1 wherein said gene has a sequence selected 
1 5 from the group consisting of SEQ ID NO: 1-18. 

3. The process of claim 1 wherein the cell is a cancer cell and the 
difference in expression is a decrease in expression. 

20 4. The process of claim 3 wherein said cancer cell is a prostate cancer 

cell. 

5. A process for identifying an anti-neoplastic agent comprising 
contacting a cell exhibiting neoplastic activity with a compound first identified 

25 as a cancer related gene modulator using the process of claim 1 and 
detecting a decrease in said neoplastic activity after said contacting compared 
to when said contacting does not occur. 

6. The process of claim 5 wherein said neoplastic activity is 
30 accelerated cellular replication. 
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7. The process of claim 5 wherein said decrease in neoplastic activity 
results from the death of the cell. 



8. A process for identifying an anti-neoplastic agent comprising 
5 administering to an animal exhibiting a cancer condition an effective amount 

of an agent first identified according to the process of claim 1 and detecting a 
decrease in said cancerous condition. 

9. A process for determining the cancerous status of a cell, comprising 
1 0 determining an increase in the level of expression in said cell of a gene that 

corresponds to a polynucleotide having a sequence selected from the group 
consisting of SEQ ID NO: 1-18 wherein an elevated expression relative to a 
known non-cancerous cell indicates a cancerous state or potentially 
cancerous state. 

15 

10. The process of claim 9 wherein said elevated expression is due to 
an increased copy number. 

11. An isolated polypeptide comprising an amino acid sequence 
20 homologous to an amino acid sequence selected from the group consisting of 

SEQ ID NO: 19-26 wherein any difference between said amino acid 
sequence and the sequence of SEQ ID NO: 19 - 26 is due solely to 
conservative amino acid substitutions and wherein said isolated polypeptide 
comprises at least one immunogenic fragment. 

25 

12. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 19-26. 

13. An antibody that reacts with a polypeptide comprising an amino 
30 acid sequence selected from the group consisting of SEQ ID NO: 19-26. 
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14. The antibody of claim 13 wherein said antibody is a recombinant 
antibody. 

15. The antibody of claim 13 wherein said antibody is a synthetic 
5 antibody. 

16. The antibody of claim 13 wherein said antibody is a humanized 
antibody. 

10 17. An immunoconjugate comprising the antibody of claim 13 and a 

cytotoxic agent. 

18. The antibody of claim 17 wherein said cytotoxic agent is a member 
selected from the group consisting of a calicheamicin, a maytansinoid, an 

1 5 adozelesin, a cytotoxic protein, a taxol, a taxotere, a taxoid and DC1. 

19. The immunoconjugate of claim 18 wherein said calicheamicin is 
calicheamicin N-acetyl gamma calicheamicin dimethyl hydrazide or 
calicheamicin Gi 1 . 

20 

20. The immunoconjugate of claim 18 wherein said maytansinoid is 

DM1. 

21. The immunoconjugate of claim 18 wherein said cytotoxic protein is 
25 ricin, abrin, gelonin, pseudomonas exotoxin or diphtheria toxin. 

22. The immunoconjugate of claim 18 wherein said taxol is paclitaxel. 

23. The immunoconjugate of claim 18 wherein said taxotere is 
30 docetaxel. 
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24. A process for treating cancer comprising contacting a cancerous 
cell in vivo with an agent having activity against an expression product 
encoded by a gene sequence selected from the group consisting of SEQ ID 
NO: 1 - 18. 

5 

25. The process of claim 24 wherein said agent is an antibody of claim 

t 

13. 

26. The process of claim 24 wherein said agent is an immunoconjugate 
10 of claim 17. 

27. An immunogenic composition comprising a polypeptide of claim 11. 

28. An immunogenic composition comprising a polypeptide of claim 12. 

15 

29. The process of claim 24 wherein said cancer is prostate cancer. 

30. A process for treating cancer in an animal afflicted therewith 
comprising administering to said animal an amount of an immunogenic 

20 composition of claim 27 sufficient to elicit the production of cytotoxic T 
lymphocytes specific for the polypeptide of claim 1 1 . 

31. A process for treating cancer in an animal afflicted therewith 
comprising administering to said animal an amount of an immunogenic 

25 composition of claim 28 sufficient to elicit the production of cytotoxic T 
lymphocytes specific for the polypeptide of claim 12. 

32. A process for treating a cancerous condition in an animal afflicted 
therewith comprising administering to said animal a therapeutically effective 

30 amount of an agent first identified as having anti-neoplastic activity using the 
process of claim 8. 
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33. A process for protecting an animal against cancer comprising 
administering to an animal at risk of developing cancer a therapeutically 
effective amount of an agent first identified as having antineoplastic activity 
using the process of claim 8. 

5 

34. The process of claim 30 wherein said animal is a human being. 

35. The process of claim 30 wherein said cancer is prostate cancer. 

10 36. A method for diagnosing prostate cancer in a patient comprising 

determining the presence of a cancer-related polypeptide in one or more 
prostate cells of said patient wherein said cancer-related polypeptide 
comprises a polypeptide that reacts with an antibody specific for a polypeptide 
having an amino acid sequence selected from SEQ ID NO: 19-26. 

15 

37. The method of claim 36 wherein said cancer-related polypeptide 
comprises a polypeptide of SEQ ID NO: 19-26. 

38. The method of claim 36 wherein said cancer-related polypeptide is 
20 a member selected from the group consisting of SEQ ID NO: 19-26. 

39. A method for producing test data with respect to the gene 
modulating activity of a compound comprising: 

(a) contacting a compound with a cell containing a polynucleotide 
25 comprising a nucleotide sequence corresponding to a gene whose expression 

is increased in a cancerous cell over that in a non-cancerous cell and under 
conditions wherein said polynucleotide is being expressed, 

(b) determining a change in expression of polynucleotides as a result of 
said contacting, and 

30 (c) producing test data with respect to the gene modulating activity of 

said compound based on a decrease in the expression of the determined 
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gene whose expression is otherwise increased in a cancerous cell over that in 
a non-cancerous cell indicating gene modulating activity. 
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SEQUENCE LISTING 

<110> Avalon Pharmaceuticals 

<120> Cancer-Linked Gene as Target for Chemotherapy 

<130> 689290-166 

<140> 
<141> 

<150> US/60/386, 651 

<151> 2002-06-06 

<160> 26 



<210> 1 

<211> 5271 

<212> DNA 

<213> Homo sapiens 

<400> 1 

ggacaggcgt ggcggccgga gccccagcat ccctgcttga ggtccaggag cggagcccgc 60 

ggccaccgcc gcctgatcag cgcgaccccg gcccgcgccc gccccgcccg gcaagatgct 120 

gcccgtgtac caggaggtga agcccaaccc gctgcaggac gcgaacatct gctcacgcgt 180 

gttcttctgg tggctcaatc ccttgtttaa aattggccat aaacggagat tagaggaaga 240 

tgatatgtat tcagtgctgc cagaagaccg ctcacagcac cttggagagg agttgcaagg 300 

gttctgggat aaagaagttt taagagctga gaatgacgca cagaagcctt ctttaacaag 360 

agcaatcata aagtgttact ggaaatctta tttagttttg ggaattttta cgttaattga 420 

ggaaagtgcc aaagtaatcc agcccatatt tttgggaaaa attattaatt attttgaaaa 480 

ttatgatccc atggattctg tggctttgaa cacagcgtac gcctatgcca cggtgctgac 540 

tttttgcacg ctcattttgg ctatactgca tcacttatat ttttatcacg ttcagtgtgc 600 

tgggatgagg ttacgagtag ccatgtgcca tatgatttat cggaaggcac ttcgtcttag 660 

taacatggcc atggggaaga caaccacagg ccagatagtc aatctgctgt ccaatgatgt 720 

gaacaagttt gatcaggtga cagtgttctt acacttcctg tgggcaggac cactgcaggc 780 

gatcgcagtg actgccctac tctggatgga gataggaata tcgtgccttg ctgggatggc 840 

agttctaatc attctcctgc ccttgcaaag ctgttttggg aagttgttct catcactgag 900 

gagtaaaact gcaactttca cggatgccag gatcaggacc atgaatgaag ttataactgg 960 

tataaggata ataaaaatgt acgcctggga aaagtcattt tcaaatctta ttaccaattt 1020 

gagaaagaag gagatttcca agattctgag aagttcctgc ctcaggggga tgaatttggc 1080 

ttcgtttttc agtgcaagca aaatcatcgt gtttgtgacc ttcaccacct acgtgctcct 1140 

cggcagtgtg atcacagcca gccgcgtgtt cgtggcagtg acgctgtatg gggctgtgcg 1200 

gctgacggtt accctcttct tcccctcagc cattgagagg gtgtcagagg caatcgtcag 1260 

catccgaaga atccagacct ttttgctact tgatgagata tcacagcgca accgtcagct 1320 

gccgtcagat ggtaaaaaga tggtgcatgt gcaggatttt actgcttttt gggataaggc 1380 

atcagagacc ccaactctac aaggcctttc ctttactgtc agacctggcg aattgttagc 1440 

tgtggtcggc cccgtgggag cagggaagtc atcactgtta agtgccgtgc tcggggaatt 1500 

ggccccaagt cacgggctgg tcagcgtgca tggaagaatt gcctatgtgt ctcagcagcc 1560 

ctgggtgttc tcgggaactc tgaggagtaa tattttattt gggaagaaat atgaaaagga 1620 

acgatatgaa aaagtcataa aggcttgtgc tctgaaaaag gatttacagc tgttggagga 1680 

tggtgatctg actgtgatag gagatcgggg aaccacgctg agtggagggc agaaagcacg 1740 

ggtaaacctt gcaagagcag tgtatcaaga tgctgacatc tatctcctgg acgatcctct 1800 

cagtgcagta gatgcggaag ttagcagaca cttgttcgaa ctgtgtattt gtcaaatttt 1860 

gcatgagaag atcacaattt tagtgactca tcagttgcag tacctcaaag ctgcaagtca 1920 

gattctgata ttgaaagatg gtaaaatggt gcagaagggg acttacactg agttcctaaa 1980 

atctggtata gattttggct cccttttaaa gaaggataat gaggaaagtg aacaacctcc 2040 
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agttccagga actcccacac taaggaatcg taccttctca gagtcttcgg tttggtctca 2100 

acaatcttct agaccctcct tgaaagatgg tgctctggag agccaagata cagagaatgt 2160 

cccagttaca ctatcagagg agaaccgttc tgaaggaaaa gttggttttc aggcctataa 2220 

gaattacttc agagctggtg ctcactggat tgtcttcatt ttccttattc tcctaaacac 2280 

tgcagctcag gttgcctatg tgcttcaaga ttggtggctt tcatactggg caaacaaaca 2340 

aagtatgcta aatgtcactg taaatggagg aggaaatgta accgagaagc tagatcttaa 2400 

ctggtactta ggaatttatt caggtttaac tgtagctacc gttctttttg gcatagcaag 2460 

atctctattg gtattctacg tccttgttaa ctcttcacaa actttgcaca acaaaatgtt 2520 

tgagtcaatt ctgaaagctc cggtattatt ctttgataga aatccaatag gaagaatttt 2580 

aaatcgtttc tccaaagaca ttggacactt ggatgatttg ctgccgctga cgtttttaga 2640 

tttcatccag acattgctac aagtggttgg tgtggtctct gtggctgtgg ccgtgattcc 2700 

ttggatcgca atacccttgg ttccccttgg aatcattttc atttttcttc ggcgatattt 2760 

tttggaaacg tcaagagatg tgaagcgcct ggaatctaca actcggagtc cagtgttttc 2820 

ccacttgtca tcttctctcc aggggctctg gaccatccgg gcatacaaag cagaagagag 2880 

gtgtcaggaa ctgtttgatg cacaccagga tttacattca gaggcttggt tcttgttttt 294 0 

gacaacgtcc cgctggttcg ccgtccgtct ggatgccatc tgtgccatgt ttgtcatcat 3000 

cgttgccttt gggtccctga ttctggcaaa aactctggat gccgggcagg ttggtttggc 3060 

actgtcctat gccctcacgc tcatggggat gtttcagtgg tgtgttcgac aaagtgctga 3120 

agttgagaat atgatgatct cagtagaaag ggtcattgaa tacacagacc ttgaaaaaga 3180 

agcaccttgg gaatatcaga aacgcccacc accagcctgg ccccatgaag gagtgataat 3240. 

ctttgacaat gtgaacttca tgtacagtcc aggtgggcct ctggtactga agcatctgac 3300 

agcactcatt aaatcacaag aaaaggttgg cattgtggga agaaccggag ctggaaaaag 3360 

ttccctcatc tcagcccttt ttagattgtc agaacccgaa ggtaaaattt ggattgataa 3420 

gatcttgaca actgaaattg gacttcacga tttaaggaag aaaatgtcaa tcatacctca 3480 

ggaacctgtt ttgttcactg gaacaatgag gaaaaacctg gatcccttta atgagcacac 3540 

ggatgaggaa ctgtggaatg ccttacaaga ggtacaactt aaagaaacca ttgaagatct 3600 

tcctggtaaa atggatactg aattagcaga atcaggatcc aattttagtg ttggacaaag 3660 

acaactggtg tgccttgcca gggcaattct caggaaaaat cagatattga ttattgatga 3720 

agcgacggca aatgtggatc caagaactga tgagttaata caaaaaaaaa tccgggagaa 3780 

atttgcccac tgcaccgtgc taaccattgc acacagattg aacaccatta ttgacagcga 3840 

caagataatg gttttagatt caggaagact gaaagaatat gatgagccgt atgttttgct 3900 

gcaaaataaa gagagcctat tttacaagat ggtgcaacaa ctgggcaagg cagaagccgc 3960 

tgccctcact gaaacagcaa aacaggtata cttcaaaaga aattatccac atattggtca 4020 

cactgaccac atggttacaa acacttccaa tggacagccc tcgaccttaa ctattttcga 4080 

gacagcactg tgaatccaac caaaatgtca agtccgttcc gaaggcattt tccactagtt 4140 

tttggactat gtaaaccaca ttgtactttt ttttactttg gcaacaaata tttatacata 4200 

caagatgcta gttcatttga atatttctcc caacttatcc aaggatctcc agctctaaca 4260 

aaatggttta tttttattta aatgtcaata gttgtttttt aaaatccaaa tcagaggtgc 4320 

aggccaccag ttaaatgccg tctatcaggt tttgtgcctt aagagactac agagtcaaag 4380 

ctcattttta aaggagtagg acaaagttgt cacaggtttt tgttgttgtt tttattgccc 4440 

ccaaaattac atgttaattt ccatttatat cagggattct atttacttga agactgtgaa 4500 

gttgccattt tgtctcattg ttttctttga cataactagg atccattatt tcccctgaag 4560 

gcttcttgtt agaaaatagt acagttacaa ccaataggaa caacaaaaag aaaaagtttg 4 620 

tgacattgta gtagggagtg tgtacccctt actccccatc aaaaaaaaaa atggatacat 4680 

ggttaaagga tagaagggca ataftttatc atatgttcta aaagagaagg aagagaaaat 4740 

actactttct caaaatggaa gcccttaaag gtgctttgat actgaaggac acaaatgtga 4800 

ccgtccatcc tcctttagag ttgcatgact tggacacggt aactgttgca gttttagact 4860 

cagcattgtg acacttccca agaaggccaa acctctaacc gacattcctg aaatacgtgg 4920 

cattattctt ttttggattt ctcatttatg gaaggctaac cctctgttga ccgtaagcct 4980 

tttggtttgg gctgtattga aatcctttct aaattgcatg aataggctct gctaacgtga 5040 

tgagacaaac tgaaaattat tgcaagcatt gactataatt atgcagtacg ttctcaggat 5100 

gcatccaggg gttcattttc atgagcctgt ccaggttagt ttactcctga ccactaatag 5160 

cattgtcatt tgggctttct gttgaatgaa tcaacaaacc acaatacttc ctgggacctt 5220 

ttgtacttta tttgaactat gagtctgtat taactgcttg gggtaccgag c 5271 



<210> 2 
<211> 5284 
<212> DNA 
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<213> Homo sapiens 
<400> 2 

ggacaggcgt ggcggccgga gccccagcat ccctgcttga ggtccaggag cggagcccgc 60 

ggccaccgcc gcctgatcag cgcgaccccg gcccgcgccc gccccgcccg gcaagatgct 120 

gcccgtgtac caggaggtga agcccaaccc gctgcaggac gcgaacatct gctcacgcgt 180 

gttcttctgg tggctcaatc ccttgtttaa aattggccat aaacggagat tagaggaaga 240 

tgatatgtat tcagtgctgc cagaagaccg ctcacagcac cttggagagg agttgcaagg 300 

gttctgggat aaagaagttt taagagctga gaatgacgca cagaagcctt ctttaacaag 360 

agcaatcata aagtgttact ggaaatctta tttagttttg ggaattttta cgttaattga 420 

ggaaagtgcc aaagtaatcc agcccatatt tttgggaaaa attattaatt attttgaaaa 480 

ttatgatccc atggattctg tggctttgaa cacagcgtac gcctatgcca cggtgctgac 540 

tttttgcacg ctcattttgg ctatactgca tcacttatat ttttatcacg ttcagtgtgc 600 

tgggatgagg ttacgagtag ccatgtgcca tatgatttat cggaaggcac ttcgtcttag 660 

taacatggcc atggggaaga caaccacagg ccagatagtc aatctgctgt ccaatgatgt 720 

gaacaagttt gatcaggtga cagtgttctt acacttcctg tgggcaggac cactgcaggc 780 

gatcgcagtg actgccctac tctggatgga gataggaata tcgtgccttg ctgggatggc 84 0 

agttctaatc attctcctgc ccttgcaaag ctgttttggg aagttgttct catcactgag 900 

gagtaaaact gcaactttca cggatgccag gatcaggacc atgaatgaag ttataactgg 960 

tataaggata ataaaaatgt acgcctggga aaagtcattt tcaaatctta ttaccaattt 1020 

gagaaagaag gagatttcca agattctgag aagttcctgc ctcaggggga tgaatttggc 1080 

ttcgtttttc agtgcaagca aaatcatcgt gtttgtgacc, ttcaccacct acgtgctcct 1140 

cggcagtgtg atcacagcca gccgcgtgtt cgtggcagtg acgctgtatg gggctgtgcg 1200 

gctgacggtt accctcttct tcccctcagc cattgagagg gtgtcagagg caatcgtcag 1260 

catccgaaga atccagacct ttttgctact tgatgagata tcacagcgca accgtcagct 1320 

gccgtcagat ggtaaaaaga tggtgcatgt gcaggatttt actgcttttt gggataaggc 1380 

atcagagacc ccaactctac aaggcctttc ctttactgtc agacctggcg aattgttagc 1440 

tgtggtcggc cccgtgggag cagggaagtc atcactgtta agtgccgtgc tcggggaatt 1500 

ggccccaagt cacgggctgg tcagcgtgca tggaagaatt gcctatgtgt ctcagcagcc 1560 

ctgggtgttc tcgggaactc tgaggagtaa tattttattt gggaagaaat atgaaaagga 1620 

acgatatgaa aaagtcataa aggcttgtgc tctgaaaaag gatttacagc tgttggagga 1680 

tggtgatctg actgtgatag gagatcgggg aaccacgctg agtggagggc agaaagcacg 1740 

ggtaaacctt gcaagagcag tgtatcaaga tgctgacatc tatctcctgg acgatcctct 1800 

cagtgcagta gatgcggaag ttagcagaca cttgttcgaa ctgtgtattt gtcaaatttt 1860 

gcatgagaag atcacaattt tagtgactca tcagttgcag tacctcaaag ctgcaagtca 1920 

gattctgata ttgaaagatg gtaaaatggt gcagaagggg acttacactg agttcctaaa 1980 

atctggtata gattttggct cccttttaaa gaaggataat gaggaaagtg aacaacctcc 2040 

agttccagga actcccacac taaggaatcg taccttctca gagtcttcgg tttggtctca 2100 

acaatcttct agaccctcct tgaaagatgg tgctctggag agccaagata cagagaatgt 2160 

cccagttaca ctatcagagg agaaccgttc tgaaggaaaa gttggttttc aggcctataa 2220 

gaattacttc agagctggtg ctcactggat tgtcttcatt ttccttattc tcctaaacac 2280 

tgcagctcag gttgcctatg tgcttcaaga ttggtggctt tcatactggg caaacaaaca 2340 

aagtatgcta aatgtcactg taaatggagg aggaaatgta accgagaagc tagatcttaa 2400 

ctggtactta ggaatttatt caggtttaac tgtagctacc gttctttttg gcatagcaag 24 60 

atctctattg gtattctacg tccttgttaa ctcttcacaa actttgcaca acaaaatgtt 2520 

tgagtcaatt ctgaaagctc cggtattatt ctttgataga aatccaatag gaagaatttt 2580 

aaatcgtttc tccaaagaca ttggacactt ggatgatttg ctgccgctga cgtttttaga 2640 

tttcatccag acattgctac aagtggttgg tgtggtctct gtggctgtgg ccgtgattcc 2700 

ttggatcgca atacccttgg ttccccttgg aatcattttc atttttcttc ggcgatattt 2760 

tttggaaacg tcaagagatg tgaagcgcct ggaatctaca actcggagtc cagtgttttc 2820 

ccacttgtca tcttctctcc aggggctctg gaccatccgg gcatacaaag cagaagagag 2880 

gtgtcaggaa ctgtttgatg cacaccagga tttacattca gaggcttggt tcttgttttt 2940 

gacaacgtcc cgctggttcg ccgtccgtct ggatgccatc tgtgccatgt ttgtcatcat 3000 

cgttgccttt gggtccctga ttctggcaaa aactctggat gccgggcagg ttggtttggc 3060 

actgtcctat gccctcacgc tcatggggat gtttcagtgg tgtgttcgac aaagtgctga 3120 

agttgagaat atgatgatct cagtagaaag ggtcattgaa tacacagacc ttgaaaaaga 3180 

agcaccttgg gaatatcaga aacgcccacc accagcctgg ccccatgaag gagtgataat 3240 

ctttgacaat gtgaacttca tgtacagtcc aggtgggcct ctggtactga agcatctgac 3300 

agcactcatt aaatcacaag aaaaggttgg cattgtggga agaaccggag ctggaaaaag 3360 
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ttccctcatc tcagcccttt ttagattgtc agaacccgaa ggtaaaattt ggattgataa 3420 

gatcttgaca actgaaattg gacttcacga tttaaggaag aaaatgtcaa tcatacctca 3480 

ggaacctgtt ttgttcactg gaacaatgag gaaaaacctg gatcccttta atgagcacac 3540 

ggatgaggaa ctgtggaatg ccttacaaga ggtacaactt aaagaaacca ttgaagatct 3600 

tcctggtaaa atggatactg aattagcaga atcaggatcc aattttagtg ttggacaaag 3660 

acaactggtg tgccttgcca gggcaattct caggaaaaat cagatattga ttattgatga 3720 

agcgacggca aatgtggatc caagaactga tgagttaata caaaaaaaaa tccgggagaa 3780 

atttgcccac tgcaccgtgc taaccattgc acacagattg aacaccatta ttgacagcga 3840 

caagataatg gttttagatt caggaagact gaaagaatat gatgagccgt atgttttgct 3900 

gcaaaataaa gagagcctat tttacaagat ggtgcaacaa ctgggcaagg cagaagccgc 3960 

tgccctcact gaaacagcaa aacaggtata cttcaaaaga aattatccac atattggtca 4 020 

cactgaccac atggttacaa acacttccaa tggacagccc tcgaccttaa ctattttcga 4080 

gacagcactg tgaatccaac caaaatgtca agtccgttcc gaaggcattt tccactagtt 4140 

tttggactat gtaaaccaca ttgtactttt ttttactttg gcaacaaata tttatacata 4200 

caagatgcta gttcatttga atatttctcc caacttatcc aaggatctcc agctctaaca 4260 

aaatggttta tttttattta aatgtcaata gttgtttttt aaaatccaaa tcagaggtgc 4320 

aggccaccag ttaaatgccg tctatcaggt tttgtgcctt aagagactac agagtcaaag 4380 

ctcattttta aaggagtagg acaaagttgt cacaggtttt tgttgttgtt tttattgccc 4440 

ccaaaattac atgttaattt ccatttatat cagggattct atttacttga agactgtgaa A ^ nn 

gttgccattt tgtctcattg ttttctttga cataactagg atccattatt tcccctgaag 

gcttcttgtt agaaaatagt acagttacaa ccaataggaa caacaaaaag aaaaagtttg 

tgacattgta gtagggagtg tgtacccctt actccccatc aaaaaaaaaa atggatacat 

ggttaaagga tagaagggca atattttatc atatgttcta aaagagaagg aagagaaaat 4740 

actactttct caaaatggaa gcccttaaag gtgctttgat actgaaggac acaaatgtga 4 800 

ccgtccatcc tcctttagag ttgcatgact tggacacggt aactgttgca gttttagact 



4500 
4560 
4620 
4680 



4860 



cagcattgtg acacttccca agaaggccaa acctctaacc gacattcctg aaatacgtgg 4 920 

cattattctt ttttggattt ctcatttatg gaaggctaac cctctgttga ccgtaagcct 4980 

tttggtttgg gctgtattga aatcctttct aaattgcatg aataggctct tggcttaaac 5040 

ccggttgaat gagacaaact gagacaactt taaatttggc ccaaaggcca aactttgtcg 5100 

ggaacgccct ttttccgtgc acaacaccga cgacccccac ccttttctac aaacccaaac 5160 

aacgtgtaca acctttcggg ccccaacgca cagcccctcc caccgcgccg cccacgaccg 5220 

cactaacgcc ccggccactt ctacaactcc cctctccccc acaaggggcg gacaacttgg 5280 

ggca 5284 



<210> 3 

<211> 3521 

<212> DNA 

<213> Homo sapiens 



<400> 3 

ggacaggcgt ggcggccgga gccccagcat ccctgcttga ggtccaggag cggagcccgc 60 
ggccaccgcc gcctgatcag cgcgaccccg gcccgcgccc gccccgcccg gcaagatgct 120 
gcccgtgtac caggaggtga agcccaaccc gctgcaggac gcgaacatct gctcacgcgt 180 
gttcttctgg tggctcaatc ccttgtttaa aattggccat aaacggagat tagaggaaga 240 
tgatatgtat tcagtgctgc cagaagaccg ctcacagcac cttggagagg agttgcaagg 300 
gttctgggat aaagaagttt taagagctga gaatgacgca cagaagcctt ctttaacaag 360 
agcaatcata aagtgttact ggaaatctta tttagttttg ggaattttta cgttaattga 420 
ggaaagtgcc aaagtaatcc agcccatatt tttgggaaaa attattaatt attttgaaaa 480 
ttatgatccc atggattctg tggctttgaa cacagcgtac gcctatgcca cggtgctgac 540 

600 
660 



tttttgcacg ctcattttgg ctatactgca tcacttatat ttttatcacg ttcagtgtgc 

tgggatgagg ttacgagtag ccatgtgcca tatgatttat cggaaggcac ttcgtcttag 

taacatggcc atggggaaga caaccacagg ccagatagtc aatctgctgt ccaatgatgt 720 

gaacaagttt gatcaggtga cagtgttctt acacttcctg tgggcaggac cactgcaggc 780 

gatcgcagtg actgccctac tctggatgga gataggaata tcgtgccttg ctgggatggc 840 

agttctaatc attctcctgc ccttgcaaag ctgttttggg aagttgttct catcactgag 900 

gagtaaaact gcaactttca cggatgccag gatcaggacc atgaatgaag ttataactgg 960 

tataaggata ataaaaatgt acgcctggga aaagtcattt tcaaatctta ttaccaattt 1020 

gagaaagaag gagatttcca agattctgag aagttcctgc ctcaggggga tgaatttggc 1080 
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ttcgtttttc 
cggcagtgtg 
gctgacggtt 
catccgaaga 
gccgtcagat 
atcagagacc 
tgtggtcggc 
ggccccaagt 
ctgggtgttc 
acgatatgaa 
tggtgatctg 
ggtaaacctt 
cagtgcagta 
gcatgagaag 
gattctgata 
atctggtata 
agttccagga 
acaatcttct 
cccagttaca 
gaattacttc 
tgcagctcag 
aagtatgcta 
ctggtactta 
atctctattg 
tgagtcaatt 
aaatcgtttc 
tttcatccag 
ttggatcgca 
tttggaaacg 
ccacttgtca 
gtgtcaggaa 
tggtttcaaa 
tgttggcagg 
cttccccatc 
ctttgtgccc 
gtcagcaaac 
attgtctctg 
ttggcaaatg 
cacgtgctgt 
agaatttcca 
tgattaaatg 



agtgcaagca 
atcacagcca 
accctcttct 
atccagacct 
ggtaaaaaga 
ccaactctac 
cccgtgggag 
cacgggctgg 
tcgggaactc 
aaagtcataa 
actgtgatag 
gcaagagcag 
gatgcggaag 
atcacaattt 
ttgaaagatg 
gattttggct 
actcccacac 
agaccctcct 
ctatcagagg 
agagctggtg 
gttgcctatg 
aatgtcactg 
ggaatttatt 
gtattctacg 
ctgaaagctc 
tccaaagaca 
acattgctac 
atacccttgg 
tcaagagatg 
tcttctctcc 
ctgtttgatg 
ggacaggatc 
cacctcaggg 
cccttcttct 
cctgactggc 
gttttctgca 
ttgtgattaa 
gctgtgacta 
agtttgctga 
ccaaatctac 
gcactttaaa 



aaatcatcgt 
gccgcgtgtt 
tcccctcagc 
ttttgctact 
tggtgcatgt 
aaggcctttc 
cagggaagtc 
tcagcgtgca 
tgaggagtaa 
aggcttgtgc 
gagatcgggg 
tgtatcaaga 
ttagcagaca 
tagtgactca 
gtaaaatggt 
cccttttaaa 
taaggaatcg 
tgaaagatgg 
agaaccgttc 
ctcactggat 
tgcttcaaga 
taaatggagg 
caggtttaac 
tccttgttaa 
cggtattatt 
ttggacactt 
aagtggttgg 
ttccccttgg 
tgaagcgcct 
aggggctctg 
cacaccagga 
tgctgctttt 
cccagcccca 
gggcttggac 
ttgtggggtc 
aaaggccacc 
tcacctttgc 
tgtttcaata 
ccccttgcat 
aaaattcaga 
aaccacatgc 



gtttgtgacc 
cgtggcagtg 
cattgagagg 
tgatgagata 
gcaggatttt 
ctttactgtc 
atcactgtta 
tggaagaatt 
tattttattt 
tctgaaaaag 
aaccacgctg 
tgctgacatc 
cttgttcgaa 
tcagttgcag 
gcagaagggg 
gaaggataat 
taccttctca 
tgctctggag 
tgaaggaaaa 
tgtcttcatt 
ttggtggctt 
aggaaatgta 
tgtagctacc 
ctcttcacaa 
ctttgataga 
ggatgatttg 
tgtggtctct 
aatcattttc 
ggaatctaca 
gaccatccgg 
tttacattca 
tctggcttaa 
catcctgctt 
tcctctcgga 
tttctctcac 
tagtaagtgt 
cattgtagca 
aaactttatt 
cagtgaagat 
tccgtcattc 
taaaaggaaa 



ttcaccacct 
acgctgtatg 
gtgtcagagg 
tcacagcgca 
actgcttttt 
agacctggcg 
agtgccgtgc 
gcctatgtgt 
gggaagaaat 
gatttacagc 
agtggagggc 
tatctcctgg 
ctgtgtattt 
tacctcaaag 
acttacactg 
gaggaaagtg 
gagtcttcgg 
agccaagata 
gttggttttc 
ttccttattc 
tcatactggg 
accgagaagc 
gttctttttg 
actttgcaca 
aatccaatag 
ctgccgctga 
gtggctgtgg 
atttttcttc 
actcggagtc 
gcatacaaag 
ggtctgtcta 
cctccctggt 
tttttgcgca 
atcgctgctg 
catgcatggc 
tttaggcatt 
caaaagcaga 
tacaaaaata 
tttttcaata 
caatctttat 
a 



acgtgctcct 
gggctgtgcg 
caatcgtcag 
accgtcagct 
gggataaggc 
aattgttagc 
tcggggaatt 
ctcagcagcc 
atgaaaagga 
tgttggagga 
agaaagcacg 
acgatcctct 
gtcaaatttt 
ctgcaagtca 
agttcctaaa 
aacaacctcc 
tttggtctca 
cagagaatgt 
aggcctataa 
tcctaaacac 
caaacaaaca 
tagatcttaa 
gcatagcaag 
acaaaatgtt 
gaagaatttt 
cgtttttaga 
ccgtgattcc 
ggcgatattt 
cagtgttttc 
cagaagagag 
tttctggaaa 
cagatgagct 
tcttcgtctc 
cacgttccct 
ctgtggctct 
gtgtgccatt 
tgtcaataca 
ggtgctggct 
tggaaatttc 
tttatcttgc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3521 



<210> 4 

<211> 3284 

<212> DNA 

<213> Homo sapiens 



<400> 4 

caagcagggc 

aacaaggaga 

tttagtggtt 

tgaaaccata 

aattaaatct 

tacagtgttg 

attaatgtca 

agaaggagcc 

tagctctgga 

tgtttcagtg 



atgagttcac 
gtagtatttt 
tccagctgtg 
ataatgaatg 
atgtcgatcc 
ttttccccat 
ttaaatgtga 
aggcttaggt 
tgccgggcag 
gtgtgttcga 



caccccagta 
agtctcaaaa 
gttgcattgc 
tctgaataat 
tgttggaatt 
tgtatcttga 
attgtttcaa 
ttgaagtgag 
gttggtttgg 
caaagtgctg 



acaacagtgc 
gcattggcac 
acgtagaaag 
gacattattt 
cataaaatca 
tctcaagcaa 
cattatgaag 
actgacttta 
cactgtccta 
aagttgagaa 



attacaatac 
ttacaaccta 
tggaataatg 
cttgcgtttg 
tctaaaaat t 
caaatggtaa 
ggttcctctt 
ttcccttctt 
tgccctcacg 
tatgatgatc 



ttaataatgg 
ttgctttgtt 
taatgggctt 
taatactgtt 
tttctaaata 
aagtatagct 
ggtaagtggc 
cctccttacg 
ctcatgggga 
tcagtagaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gggtcattga 
caccagcctg 
caggtgggcc 
gcattgtggg 
cagaacccga 
atttaaggaa 
ggaaaaacct 
aggtacaact 
aatcaggatc 
tcaggaaaaa 
atgagttaat 
cacacagatt 
tgaaagaata 
tggtgcaaca 
acttcaaaag 
atggacagcc 
aagtccgttc 
tttttacttt 
ccaacttatc 
agttgttttt 
ttttgtgcct 
tcacaggttt 
tcagggattc 
acataactag 
accaatagga 
tactccccat 
catatgttct 
ggtgctttga 
ttggacacgg 
aacctctaac 
ggaaggctaa 
taaattgcat 
tgactataat 
tccaggttag 
atcaacaaac 
atttttcctg 
tatggtttga 
atatatgtat 
tatagcaagt 
aaaagcacac 
tcctagtact 
atgtttatta 
tgaggtttgt 
ggttaaagac 
atgaaactaa 



atacacagac 

gccccatgaa 

tctggtactg 

aagaaccgga 

aggtaaaatt 

gaaaatgtca 

ggatcccttt 

taaagaaacc 

caattttagt 

tcagatattg 

acaaaaaaaa 

gaacaccatt 

tgatgagccg 

actgggcaag 

aaattatcca 

ctcgacctta 

cgaaggcatt 

ggcaacaaat 

caaggatctc 

taaaatccaa 

taagagacta 

ttgttgttgt 

tatttacttg 

gatccattat 

acaacaaaaa 

caaaaaaaaa 

aaaagagaag 

tactgaagga 

taactgttgc 

cgacattcct 

ccctctgttg 

gaataggctc 

tatgcagtac 

tttactcctg 

cacaatactt 

atgatggtgg 

agtggagtct 

atagacatgc 

tttttgctca 

accacacaca 

tagaaataca 

atagtcatgg 

taaaacgata 

agtcatataa 

attcaattga 



cttgaaaaag 

ggagtgataa 

aagcatctga 

gctggaaaaa 

tggattgata 

atcatacctc 

aatgagcaca 

attgaagatc 

gttggacaaa 

attattgatg 

atccgggaga 

attgacagcg 

tatgttttgc 

gcagaagccg 

catattggtc 

actattttcg 

ttccactagt 

atttatacat 

cagctctaac 

atcagaggtg 

cagagtcaaa 

ttttattgcc 

aagactgtga 

ttcccctgaa 

gaaaaagttt 

aatggataca 

gaagagaaaa 

cacaaatgtg 

agttttagac 

gaaatacgtg 

accgtaagcc 

tgctaacgtg 

gttctcagga 

accactaata 

cctgggacct 

ctgtaatatg 

catgacctct 

acacgtgtgc 

tcagcagaga 

gctaactgcc 

cgtatggtta 

ttcgtatttt 

gctatattca 

acgtcctgtt 

aataaatgat 



aagcaccttg 

tctttgacaa 

cagcactcat 

gttccctcat 

agatcttgac 

aggaacctgt 

cggatgagga 

ttcctggtaa 

gacaactggt 

aagcgacggc 

aatttgccca 

acaagataat 

tgcaaaataa 

ctgccctcac 

acactgacca 

agacagcact 

ttttggacta 

acaagatgct 

aaaatggttt 

caggccacca 

gctcattttt 

cccaaaatta 

agttgccatt 

ggcttcttgt 

gtgacattgt 

tggttaaagg 

tactactttc 

accgtccatc 

tcagcattgt 

gcattattct 

ttttggtttg 

atgagacaaa 

tgcatccagg 

gcattgtcat 

tttgtacttt 

ttgagttcag 

cagaataagg 

at.ttgtttgt 

gcaacagatg 

aaaatacatt 

atgttcagtc 

aggtgactga 

aaatgtctat 

tctgttttaa 

agttttcatc 



ggaatatcag 

tgtgaacttc 

taaatcacaa 

ctcagccctt 

aactgaaatt 

tttgttcact 

actgtggaat 

aatgigatact 

gtgccttgcc 

aaatgtggat 

ctgcaccgtg 

ggttttagat 

agagagccta 

tgaaacagca 

catggttaca 

gtgaatccaa 

tgtaaaccac 

agttcatttg 

atttttattt 

gttaaatgcc 

aaaggagtag 

catgttaatt 

ttgtctcatt 

tagaaaatag 

agtagggagt 

atagaagggc 

tcaaaatgga 

ctcctttaga 

gacacttccc 

tttttggatt 

ggctgtattg 

ctgaaaatta 

ggttcatttt 

ttgggctttc 

atttgaacta 

tttactaaag 

tgtcacctcc 

atacatatat 

ttttattgag 

gaccgtagta 

caacaaacca 

aattgcaaca 

atgtttattt 

tgttatcata 

tcca 



aaacgcccac 
atgtacagtc 
gaaaaggttg 
tttagattgt 
ggacttcacg 
ggaacaatga 
gccttacaag 
gaattagcag 
agggcaattc 
ccaagaactg 
ctaaccattg 
tcaggaagac 
ttttacaaga 
aaacaggtat 
aacacttcca 
ccaaaatgtc 
attgtacttt 
aatatttctc 
aaatgtcaat 
gtctatcagg 
gacaaagttg 
tccatttata 
gttttctttg 
tacagttaca 
gtgtacccct 
aatattttat 
agcccttaaa 
gttgcatgac 
aagaaggcca 
tctcatttat 
aaatcctttc 
ttgcaagcat 
catgagcctg 
tgttgaatga 
tgagtcttta 
gttttactat 
ctgaaattgc 
ttgtccttcg 
tgaagcctta 
gctgttcaac 
cacacagtaa 
gtgatcataa 
ggacttttga 
gaatttttta 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3284 



<210> 5 

<211> 1019 

<212> DNA 

<213> Homo sapiens 



<400> 5 

ggacaggcgt ggcggccgga gccccagcat ccctgcttga ggtccaggag cggagcccgc 60 

ggccaccgcc gcctgatcag cgcgaccccg gcccgcgccc gccccgcccg gcaagatgct 120 

gcccgtgtac caggaggtga agcccaaccc gctgcaggac gcgaacatct gctcacgcgt 180 

gttcttctgg tggctcaatc ccttgtttaa aattggccat aaacggagat tagaggaaga 240 

tgatatgtat tcagtgctgc cagaagaccg ctcacagcac cttggagagg agttgcaagg 300 

gttctgggat aaagaagttt taagagctga gaatgacgca cagaagcctt ctttaacaag 360 
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agcaatcata 
ggaaagtgcc 
ttatgatccc 
tttttgcacg 
tgggatgagg 
taacatggcc 
gaacaagttt 
gatcgcagtg 
agttctaatc 
gagtaaaact 
tataaggata 



aagtgttact 
aaagtaatcc 
atggattctg 
ctcattttgg 
ttacgagtag 
atggggaaga 
gatcaggtga 
actgccctac 
attctcctgc 
gcaactttca 
ataaaaatgt 



ggaaatctta 
agcccatatt 
tggctttgaa 
ctatactgca 
ccatgtgcca 
caaccacagg 
cagtgttctt 
tctggatgga 
ccttgcaaag 
cggatgccag 
acgcctgtgg 



tttagttttg 
tttgggaaaa 
cacagcgtac 
tcacttatat 
tatgatttat 
ccagatagtc 
acacttcctg 
gataggaata 
ctgttttggg 
gatcaggacc 
taagacgrga 



ggaattttta 
attattaatt 
gcctatgcca 
ttttatcacg 
cggaaggcac 
aatctgctgt 
tgggcaggac 
tcgtgccttg 
aagttgttct 
atgaatgaag 
tcctyrgagt 



cgttaattga 
attttgaaaa 
cggtgctgac 
ttcagtgtgc 
ttcgtcttag 
ccaatgatgt 
cactgcaggc 
ctgggatggc 
catcactgag 
ttataactgg 
cactcgcag 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1019 



<210> 6 

<211> 3200 

<212> DNA 

<213> Homo sapiens 



<400> 6 

ggcagggtcc cctcaggcta ttgggaacat ttatttttcc ttgaaacatg aagcctccta 60 

ggaggcctgc gtgtgctgtt tggtggatac gtgaccatgg agcttgggag gccgcaggtt 120 

tttctgcacc cagcctcact gagctgtttt ctctgtctgt ttccccagct cggagtccag 180 

tgttttccca cttgtcatct tctctccagg ggctctggac catccgggca tacaaagcag 240 

aagagaggtg tcaggaactg tttgatgcac accaggattt acattcagag gcttggttct 300 

tgtttttgac aacgtcccgc tggttcgccg tccgtctgga tgccatctgt gccatgtttg 360 

tcatcatcgt tgcctttggg tccctgattc tggcaaaaac tctggatgcc gggcaggttg 420 

gtttggcact gtcctatgcc ctcacgctca tggggatgtt tcagtggtgt gttcgacaaa 480 

gtgctgaagt tgagaatatg atgatctcag tagaaagggt cattgaatac acagaccttg 540 

aaaaagaagc accttgggaa tatcagaaac gcccaccacc agcctggccc catgaaggag 600 

tgataatctt tgacaatgtg aacttcatgt acagtccagg tgggcctctg gtactgaagc 660 

atctgacagc actcattaaa tcacaagaaa aggttggcat tgtgggaaga accggagctg 720 

gaaaaagttc cctcatctca gcccttttta gattgtcaga acccgaaggt aaaatttgga 780 

ttgataagat cttgacaact gaaattggac ttcacgattt aaggaagaaa atgtcaatca 840 

tacctcagga acctgttttg ttcactggaa caatgaggaa aaacctggat ccctttaatg 900 

agcacacgga tgaggaactg tggaatgcct tacaagaggt acaacttaaa gaaaccattg 960 

aagatcttcc tggtaaaatg gatactgaat tagcagaatc aggatccaat tttagtgttg 1020 

gacaaagaca actggtgtgc cttgccaggg caattctcag gaaaaatcag atattgatta 1080 

ttgatgaagc gacggcaaat gtggatccaa gaactgatga gttaatacaa aaaaaaatcc 1140 

gggagaaatt tgcccactgc accgtgctaa ccattgcaca cagattgaac accattattg 1200 

acagcgacaa gataatggtt ttagattcag gaagactgaa agaatatgat gagccgtatg 1260 

ttttgctgca aaataaagag agcctatttt acaagatggt gcaacaactg ggcaaggcag 1320 

aagccgctgc cctcactgaa acagcaaaac aggtatactt caaaagaaat tatccacata 1380 

ttggtcacac tgaccacatg gttacaaaca cttccaatgg acagccctcg accttaacta 1440 

ttttcgagac agcactgtga atccaaccaa aatgtcaagt ccgttccgaa ggcattttcc 1500 

actagttttt ggactatgta aaccacattg tacttttttt tactttggca acaaatattt 1560 

atacatacaa gatgctagtt catttgaata tttctcccaa cttatccaag gatctccagc 1620 

tctaacaaaa tggtttattt ttatttaaat gtcaatagtt gttttttaaa atccaaatca 1680 

gaggtgcagg ccaccagtta aatgccgtct atcaggtttt gtgccttaag agactacaga 1740 

gtcaaagctc atttttaaag gagtaggaca aagttgtcac aggtttttgt tgttgttttt 1800 

attgccccca aaattacatg ttaatttcca tttatatcag ggattctatt tacttgaaga 1860 

ctgtgaagtt gccattttgt ctcattgttt tctttgacat aactaggatc cattatttcc 1920 

cctgaaggct tcttgttaga aaatagtaca gttacaacca ataggaacaa caaaaagaaa 1980 

aagtttgtga cattgtagta gggagtgtgt accccttact ccccatcaaa aaaaaaaatg 2040 

gatacatggt taaaggatag aagggcaata ttttatcata tgttctaaaa gagaaggaag 2100 

a'gaaaatact actttctcaa aatggaagcc cttaaaggtg ctttgatact gaaggacaca 2160 

aatgtgaccg tccatcctcc tttagagttg catgacttgg acacggtaac tgttgcagtt 2220 

ttagactcag cattgtgaca cttcccaaga aggccaaacc tctaaccgac attcctgaaa 2280 

tacgtggcat tattcttttt tggatttctc atttatggaa ggctaaccct ctgttgaccg 2340 

taagcctttt ggtttgggct gtattgaaat cctttctaaa ttgcatgaat aggctctgct 2400 
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aacgtgatga 
tcaggatgca 
ctaatagcat 
ggaccttttg 
aatatgttga 
acctctcaga 
gtgtgcattt 
cagagagcaa 
actgccaaaa 
tggttaatgt 
tattttaggt 
tattcaaaat 
cctgtttctg 
aatgatagtt 



gacaaactga 
tccaggggtt 
tgtcatttgg 
tactttattt 
gttcagttta 
ataaggtgtc 
gtttgtatac 
cagatgtttt 
tacattgacc 
tcagtccaac 
gactgaaatt 
gtctatatgt 
ttttaatgtt 
ttcatctcca 



aaattattgc 
cattttcatg 
gctttctgtt 
gaactatgag 
ctaaaggttt 
acctccctga 
atatatttgt 
attgagtgaa 
gtagtagctg 
aaaccacaca 
gcaacagtga 
ttatttggac 
atcatagaat 



aagcattgac 
agcctgtcca 
gaatgaatca 
tctttaattt 
tactattatg 
aattgcatat 
ccttcgtata 
gccttaaaaa 
ttcaactcct 
cagtaaatgt 
tcataatgag 
ttttgaggtt 
tttttaatga 



tataattatg 
ggttagttta 
acaaaccaca 
ttcctgatga 
gtttgaagtg 
atgtatatag 
gcaagttttt 
gcacacacca 
agtacttaga 
ttattaatag 
gtttgttaaa 
aaagacagtc 
aactaaattc 



cagtacgttc 
ctcctgacca 
atacttcctg 
tggtggctgt 
gagtctcatg 
acatgcacac 
tgctcatcag 
cacacagcta 
aatacacgta 
tcatggttcg 
acgatagcta 
atataaacgt 
aattgaaata 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3200 



<210> 7 

<211> 633 

<212> DNA 

<213> Homo sapiens 



<400> 7 

actcaatgtc tgggctccaa accatgcata atatccacaa ctaaaatggc catgtcacca 60 

agagagcttc ctctatttct cagattactg gaagattcat gcccaggagt atccataatt 120 

tagcattcct gggatccgga cattctctct atcccaattt ttaatcatct gagtctgtat 180 

taactgcttg ccagcagtta atacagactc agcattgtga cacttcccaa gaaggccaaa 240 

cctctaaccg acattcctga aatacgtggc attattcttt tttggatttc tcatttatgg 300 

aaggctaacc ctctgttgac cgtaagcctt ttggtttggg ctgtattgaa atcctttcta 360 

aattgcatga ataggctctg ctaacgtgat gagacaaact gaaaattatt gcaagcattg 420 

actataatta tgcagtacgt tctcaggatg catccagggg ttcattttca tgagcctgtc 480 

caggttagtt tactcctgac cactaatagc attgtcattt gggctttctg ttgaatgaat 540 

caacaaacca caatacttcc tgggaccttt tgtactttat ttgaactatg agtctgtatt 600 

aactgcttgg ggatccttag agtcgagcca gca 633 



<210> 8 

<211> 377 

<212> DNA 

<213> Homo sapiens 



<400> 8 

tttgctgcta gtggttttga ggacctagcc ggctttattg gcttatattt gtatttagtt 60 

tgtatttaac ctctagatct ttgactactg tgttgtcttt gactacggtt tcaaaatgta 120 

tgtttttcaa tgtggagcac cccattcatg cattttttca ccctgtgcct tttacattta 180 

taggtgtatt tgtcaaattt tgcatgagaa gatcacaatt ttagtgactc atcagttgca 240 

gtacctcaaa gctgcaagtc agattctgat attgaaagat gtaagtagtt tctagaggtc 300 

tcccgaactt gcaagtactt gggtgtgttg aatccccttc ttcttcctcc atttcatgtt 360 

cctttcttct cagagct 377 



<210> 9 

<211> 2602 

<212> DNA 

<213> Homo sapiens 



<400> 9 

cctccacctc ctgggttcaa gtgattctcc 

aggcgcaccc caccctgcct ggctactttt 

tgttggccag gctggtcttg aacttctgac 



tgtctcagcc tcccgagtag ctggaattac 60 
tgtattttta gtagagatgg gatttcacca 120 
caccagtgaa ccacctgcct cggcctccca 180 
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aagtgctggg attacaggca tgagccacca cacccagcct ggtcacatgt ttaatatcag 240 

cttgtcttat gaacctgttt tgttcactgg aacaatgagg aaaaacctgg atccctttaa 300 

tgagcacacg gatgaggaac tgtggaatgc cttacaagag gtacaactta aagaaaccat 360 

tgaagatctt cctggtaaaa tggatactga attagcagaa tcaggatcca attttagtgt 420 

tggacaaaga caactggtgt gccttgccag ggcaattctc aggaaaaatc agatattgat 4 80 

tattgatgaa gcgacggcaa atgtggatcc aagaactgat gagttaatac aaaaaaaaat 540 

ccgggagaaa tttgcccact gcaccgtgct aaccattgca cacagattga acaccattat 600 

tgacagcgac aagataatgg ttttagattc aggaagactg aaagaatatg atgagccgta 660 

tgttttgctg caaaataaag agagcctatt ttacaagatg gtgcaacaac tgggcaaggc 720 

agaagccgct gccctcactg aaacagcaaa acaggtatac ttcaaaagaa attatccaca 780 

tattggtcac actgaccaca tggttacaaa cacttccaat ggacagccct cgaccttaac 840 

tattttcgag acagcactgt gaatccaacc aaaatgtcaa gtccgttccg aaggcatttt 900 

ccactagttt ttggactatg taaaccacat tgtacttttt tttactttgg caacaaatat 960 

ttatacatac aagatgctag ttcatttgaa tatttctccc aacttatcca aggatctcca 1020 

gctctaacaa aatggtttat ttttatttaa atgtcaatag ttgtttttta aaatccaaat 1080 

cagaggtgca ggccaccagt taaatgccgt ctatcaggtt ttgtgcctta agagactaca 1140 

gagtcaaagc tcatttttaa aggagtagga caaagttgtc acaggttttt gttgttgttt 1200 

ttattgcccc caaaattaca tgttaatttc catttatatc agggattcta tttacttgaa 1260 

gactgtgaag ttgccatttt gtctcattgt tttctttgac ataactagga tccattattt 1320 

cccctgaagg cttcttgtta gaaaatagta cagttacaac caataggaac aacaaaaaga 1380 

aaaagtttgt gacattgtag tagggagtgt gtacccctta ctccccatca aaaaaaaaaa 1440 

tggatacatg gttaaaggat agaagggcaa tattttatca tatgttctaa aagagaagga 1500 

agagaaaata ctactttctc aaaatggaag cccttaaagg tgctttgata ctgaaggaca 1560 

caaatgtgac cgtccatcct cctttagagt tgcatgactt ggacacggta actgttgcag 1620 

ttttagactc agcattgtga cacttcccaa gaaggccaaa cctctaaccg acattcctga 1680 

aatacgtggc attattcttt tttggatttc tcatttatgg aaggctaacc ctctgttgac 1740 

cgtaagcctt ttggtttggg ctgtattgaa atcctttcta aattgcatga ataggctctg 1800 

ctaacg.tgat gagacaaact gaaaattatt gcaagcattg actataatta tgcagtacgt 1860 

tctcaggatg catccagggg ttcattttca tgagcctgtc caggttagtt tactcctgac 1920 

cactaatagc attgtcattt gggctttctg ttgaatgaat caacaaacca caatacttcc 1980 

tgggaccttt tgtactttat ttgaactatg agtctttaat ttttcctgat gatggtggct 2040 

gtaatatgtt gagttcagtt tactaaaggt tttactatta tggtttgaag tggagtctca 2100 

tgacctctca gaataaggtg tcacctccct gaaattgcat atatgtatat agacatgcac 2160 

acgtgtgcat ttgtttgtat acatatattt gtccttcgta tagcaagttt tttgctcatc 2220 

agcagagagc aacagatgtt ttattgagtg aagccttaaa aagcacacac cacacacagc 2280 

taactgccaa aatacattga ccgtagtagc tgttcaactc ctagtactta gaaatacacg 2340 

tatggttaat gttcagtcca acaaaccaca cacagtaaat gtttattaat agtcatggtt 2400 

cgtattttag gtgactgaaa ttgcaacagt gatcataatg aggtttgtta aaacgatagc 2460 

tatattcaaa atgtctatat gtttatttgg acttttgagg ttaaagacag tcatataaac 2520 

gtcctgtttc tgttttaatg ttatcataga attttttaat gaaactaaat tcaattgaaa 2580 

taaatgatag ttttcatctc ca 2602 



<210> 10 

<211> 636 

<212> DNA 

<213> Homo sapiens 



<400> 10 

atggataagt 

takttgtatt 

agtgtaatcc 

acctgcakct 

gctgtggccg 

tttcttcggc 

gagtatggaa 

acagagaccc 

ttgaacgtat 

gcaacgtgta 



ttatactagt 
ccttgtatgr 
taacaacaac 
actcctgcta 
tgattccttg 
gatatttttt 
actcgggttg 
cctgaaattc 
tggccattat 
actttgcatt 



gttggcacat 
aaaagcgtcc 
tcakgmaagt 
tttcaagaca 
gatcgcaata 
ggaaacgtca 
gtatagacat 
ggcagactct 
gaatcattgt 
tggaataagg 



ggcggcatgt 
ctggtactac 
attttgaaaa 
ttgctacaag 
cccttggttc 
agagatgtga 
gctagctagt 
gtcttccaga 
gtcccttaga 
aaggagtgaa 



atagatatay 
aataagtctt 
gaatacttgg 
tggttggtgt 
cccttggaat 
agcgcctgga 
ttccatttat 
atttctctaa 
gcatgtggaa 
ggccatatgg 



yaggassacc 
tcgwgaaagg 
ataaggaaaa 
ggtctctgtg 
cattttcatt 
atctacaagt 
gccataaatt 
cattaggtaa 
ttgatagcct 
ggagtaatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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tctacaggaa tgtcagcact gtgaagacag ggactc 



636 



<210> 11 

<211> 2420 

<212> DNA 

<213> Homo sapiens 



<400> 11 

aagatcttat 

atcataccga 

agcacacgga 

aagatcttcc 

gacaaagaca 

ttgatgaagc 

gggagaaatt 

acagcgacaa 

ttttgctgca 

aagccgctgc 

ttggtcacac 

ttttcgagac 

actagttttt 

atacatacaa 

tctaacaaaa 

gaggtgcagg 

gtcaaagctc 

attgccccca 

ctgtgaagtt 

cctgaaggct 

aagtttgtga 

gatacatggt 

agaaaatact 

aatgtgaccg 

ttagactcag 

tacgtggcat 

taagcctttt 

aacgtgatga 

tcaggatgca 

ctaatagcat 

ggaccttttg 

aatatgttga 

acctctcaga 

gtgtgcattt 

cagagagcaa 

actgccaaaa 

tggttaatgt 

tattttaggt 

tattcaaaat 

cctgtttctg 

aatgatagtt 



caatccaaat 

acctgttttg 

tgaggaactg 

tggtaaaatg 

actggtgtgc 

gacggcaaat 

tgcccactgc 

gataatggtt 

aaataaagag 

cctcactgaa 

tgaccacatg 

agcactgtga 

ggactatgta 

gatgctagtt 

tggtttattt 

ccaccagtta 

atttttaaag 

aaattacatg 

gccattttgt 

tcttgttaga 

cattgtagta 

taaaggatag 

actttctcaa 

tccatcctcc 

cattgtgaca 

tattcttttt 

ggtttgggct 

gacaaactga 

tccaggggtt 

tgtcatttgg 

tactttattt 

gttcagttta 

ataaggtgtc 

gtttgtatac 

cagatgtttt 

tacattgacc 

tcagtccaac 

gactgaaatt 

gtctatatgt 

ttttaatgtt 

ttcatctcca 



tttaccttcg 

ttcactggaa 

tggaatgcct 

gatactgaat 

cttgccaggg 

gtggatccaa 

accgtgctaa 

ttagattcag 

agcctatttt 

acagcaaaac 

gttacaaaca 

atccaaccaa 

aaccacattg 

catttgaata 

ttatttaaat 

aatgccgtct 

gagtaggaca 

ttaatttcca 

ctcattgttt 

aaatagtaca 

gggagtgtgt 

aagggcaata 

aatggaagcc 

tttagagttg 

cttcccaaga 

tggatttctc 

gtattgaaat 

aaattattgc 

cattttcatg 

gctttctgtt 

gaactatgag 

ctaaaggttt 

acctccctga 

atatatttgt 

attgagtgaa 

gtagtagctg 

aaaccacaca 

gcaacagtga 

ttatttggac 

atcatagaat 



ggttctgacg 
caatgaggaa 
tacaagaggt 
tagcagaatc 
caattctcag 
gaactgatga 
ccattgcaca 
gaagactgaa 
acaagatggt 
aggtatactt 
cttccaatgg 
aatgtcaagt 
tacttttttt 
tttctcccaa 
gtcaatagtt 
atcaggtttt 
aagttgtcac 
tttatatcag 
tctttgacat 
gttacaacca 
accccttact 
ttttatcata 
cttaaaggtg 
catgacttgg 
aggccaaacc 
atttatggaa 
cctttctaaa 
aagcattgac 
agcctgtcca 
gaatgaatca 
tctttaattt 
tactattatg 
aattgcatat 
ccttcgtata 
gccttaaaaa 
ttcaactcct 
cagtaaatgt 
tcataatgag 
ttttgaggtt 
tttttaatga 



atttaaggaa 
aaacctggat 
acaacttaaa 
aggatccaat 
gaaaaatcag 
gttaatacaa 
cagattgaac 
agaatatgat 
gcaacaactg 
caaaagaaat 
acagccctcg 
ccgttccgaa 
tactttggca 
cttatccaag 
gttttttaaa 
gtgccttaag 
aggtttttgt 
ggattctatt 
aactaggatc 
ataggaacaa 
ccccatcaaa 
tgttctaaaa 
ctttgatact 
acacggtaac 
tctaaccgac 
ggctaaccct 
ttgcatgaat 
tataattatg 
ggttagttta 
acaaaccaca 
ttcctgatga 
gtttgaagtg 
atgtatatag 
gcaagttttt 
gcacacacca 
agtacttaga 
ttattaatag 
gtttgttaaa 
aaagacagtc 
aactaaattc 



gaaaatgtca 

ccctttaatg 

gaaaccattg 

tttagtgttg 

atattgatta 

aaaaaaatcc 

accattattg 

gagccgtatg 

ggcaaggcag 

tatccacata 

accttaacta 

ggcattttcc 

acaaatattt 

gatctccagc 

atccaaatca 

agactacaga 

tgttgttttt 

tacttgaaga 

cattatttcc 

caaaaagaaa 

aaaaaaaatg 

gagaaggaag 

gaaggacaca 

tgttgcagtt 

attcctgaaa 

ctgttgaccg 

aggctctgct 

cagtacgttc 

ctcctgacca 

atacttcctg 

tggtggctgt. 

gagtctcatg 

acatgcacac 

tgctcatcag 

cacacagcta 

aatacacgta 

tcatggttcg 

acgatagcta 

atataaacgt 

aattgaaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2420 



<210> 12 

<211> 703 

<212> DNA 

<213> Homo sapiens 



<400> 12 

caagcagggc atgagttcac caccccagta acaacagtgc attacaatac ttaataatgg 



60 
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aacaaggaga gtagtatttt agtctcaaaa gcattggcac ttacaaccta ttgctttgtt 120 

tttagtggtt tccagctgtg gttgcattgc acgtagaaag tggaataatg taatgggctt* 180 

tgaaaccata ataatgaatg tctgaataat gacattattt cttgcgtttg taatactgtt 240 

aattaaatct atgtcgatcc tgttggaatt cataaaatca tctaaaaatt tttctaaata 300 

tacagtgttg ttttccccat tgtatcttga tctcaagcaa caaatggtaa aagtatagct 360 

attaatgtca ttaaatgtga attgtttcaa cattatgaag ggttcctctt ggtaagtggc 420 

agaaggagcc aggcttaggt ttgaagtgag actgacttta ttcccttctt cctccttacg 480 

tagctctgga tgccgggcag gttggtttgg cactgtccta tgccctcacg ctcatgggga 540 

tgtttcagtg gtgtgttcga caaagtgctg aagttgagaa tatgatgcct gtaatcccag 600 

ctactagaga ggctgaggca ggagaatcgc ttgaacctgg gaggcggagg ttgtggtgag 660 

ccgagatctc acaattgcac tccagcctgg gcaacaagag tga 703 



<210> 13 

<211> 1688 

<212> DNA 

<213> Homo sapiens 



<400> 13 

tcacagaaag 

ttctggaatc 

ttctgtagct 

ttaccgttct 

ttttaaagga 

attacatgtt 

cattttgtct 

ttgttagaaa 

ttgtagtagg 

aaggatagaa 

tttctcaaaa 

catcctcctt 

ttgtgacact 

ttcttttttg 

tttgggctgt 

caaactgaaa 

caggggttca 

tcatttgggc 

ctttatttga 

tcagtttact 

aaggtgtcac 

ttgtatacat 

gatgttttat 

cattgaccgt 

agtccaacaa 

ctgaaattgc 

ctatatgttt 

ttaatgttat 

catctcca 



ccacggtacc 
taaaggttcc 
caactgagca 
caaagtctat 
gtaggacaaa 
aatttccatt 
cattgttttc 
atagtacagt 
gagtgtgtac 
gggcaatatt 
tggaagccct 
tagagttgca 
tcccaagaag 
gatttctcat 
attgaaatcc 
attattgcaa 
ttttcatgag 
tttctgttga 
actatgagtc 
aaaggtttta 
ctccctgaaa 
atatttgtcc 
tgagtgaagc 
agtagctgtt 
accacacaca 
aacagtgatc 
atttggactt 
catagaattt 



attgtgtatg 
ttgggtgtgc 
aaacacactc 
caggttttgt 
gttgtcacag 
tatatcaggg 
tttgacataa 
tacaaccaat 
cccttactcc 
ttatcatatg 
taaaggtgct 
tgacttggac 
gccaaacctc 
ttatggaagg 
tttctaaatt 
gcattgacta 
cctgtccagg 
atgaatcaac 
tttaattttt 
ctattatggt 
ttgcatatat 
ttcgtatagc 
cttaaaaagc 
caactcctag 
gtaaatgttt 
ataatgaggt 
ttgaggttaa 
tttaatgaaa 



tgtatcgtat 
aactttggtg 
gcacattcat 
gccttaagag 
gtttttgttg 
attctattta 
ctaggatcca 
aggaacaaca 
ccatcaaaaa 
ttctaaaaga 
ttgatactga 
acggtaactg 
taaccgacat 
ctaaccctct 
gcatgaatag 
taattatgca 
ttagtttact 
aaaccacaat 
cctgatgatg 
ttgaagtgga 
gtatatagac 
aagttttttg 
acacaccaca 
tacttagaaa 
attaatagtc 
ttgttaaaac 
agacagtcat 
ctaaattcaa 



attgcctgta 
ccctactcca 
ctacttattc 
actacagagt 
ttgtttttat 
cttgaagact 
ttatttcccc 
aaaagaaaaa 
aaaaaatgga 
gaaggaagag 
aggacacaaa 
ttgcagtttt 
tcctgaaata 
gttgaccgta 
gctctgctaa 
gtacgttctc 
cctgaccact 
acttcctggg 
gtggctgtaa 
gtctcatgac 
atgcacacgt 
ctcatcagca 
cacagctaac 
tacacgtatg 
atggttcgta 
gatagctata 
ataaacgtcc 
ttgaaataaa 



tcacttagct 
cgtaaaaatc 
atcatgctta 
caaagctcat 
tgcccccaaa 
gtgaagttgc 
tgaaggcttc 
gtttgtgaca 
tacatggtta 
aaaatactac 
tgtgaccgtc 
agactcagca 
cgtggcatta 
agccttttgg 
cgtgatgaga 
aggatgcatc 
aatagcattg 
accttttgta 
tatgttgagt 
ctctcagaat 
gtgcatttgt 
gagagcaaca 
tgccaaaata 
gttaatgttc 
ttttaggtga 
ttcaaaatgt 
tgtttctgtt 
tgatagtttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1688 



<210> 14 

<211> 5833 

<212> DNA 

<213> Homo sapiens 



<400> 14 

ggacaggcgt ggcggccgga gccccagcat ccctgcttga ggtccaggag cggagcccgc 

ggccaccgcc gcctgatcag cgcgaccccg gcccgcgccc gccccgcccg gcaagatgct 

gcccgtgtac caggaggtga agcccaaccc gctgcaggac gcgaacatct gctcacgcgt 



60 
120 
180 
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gttcttctgg tggctcaatc ccttgtttaa aattggccat aaacggagat tagaggaaga 240 

tgatatgtat tcagtgctgc cagaagaccg ctcacagcac cttggagagg agttgcaagg 300 

gttctgggat aaagaagttt taagagctga gaatgacgca cagaagcctt ctttaacaag 360 

agcaatcata aagtgttact ggaaatctta tttagttttg ggaattttta cgttaattga 420 

ggaaagtgcc aaagtaatcc agcccatatt tttgggaaaa attattaatt attttgaaaa 480 

ttatgatccc atggattctg tggctttgaa cacagcgtac gcctatgcca cggtgctgac 540 

tttttgcacg ctcattttgg ctatactgca tcacttatat ttttatcacg ttcagtgtgc 600 

tgggatgagg ttacgagtag ccatgtgcca tatgatttat cggaaggcac ttcgtcttag 660 

taacatggcc atggggaaga caaccacagg ccagatagtc aatctgctgt ccaatgatgt 720 

gaacaagttt gatcaggtga cagtgttctt acacttcctg tgggcaggac cactgcaggc 780 

gatcgcagtg actgccctac tctggatgga gataggaata tcgtgccttg ctgggatggc 840 

agttctaatc attctcctgc ccttgcaaag ctgttttggg aagttgttct catcactgag 900 

gagtaaaact gcaactttca cggatgccag gatcaggacc atgaatgaag ttataactgg 960 

tataaggata ataaaaatgt acgcctggga aaagtcattt tcaaatctta ttaccaattt 1020 

gagaaagaag gagatttcca agattctgag aagttcctgc ctcaggggga tgaatttggc 1080 

ttcgtttttc agtgcaagca aaatcatcgt gtttgtgacc ttcaccacct acgtgctcct 1140 

cggcagtgtg atcacagcca gccgcgtgtt cgtggcagtg acgctgtatg gggctgtgcg 1200 

gctgacggtt accctcttct tcccctcagc cattgagagg gtgtcagagg" caatcgtcag 1260 

catccgaaga atccagacct ttttgctact tgatgagata tcacagcgca accgtcagct 1320 

gccgtcagat ggtaaaaaga tggtgcatgt gcaggatttt actgcttttt gggataaggc 1380 

atcagagacc ccaactctac aaggcctttc ctttactgtc agacctggcg aattgttagc 1440 

tgtggtcggc cccgtgggag cagggaagtc atcactgtta agtgccgtgc tcggggaatt 1500 

ggccccaagt cacgggctgg tcagcgtgca tggaagaatt gcctatgtgt ctcagcagcc 1560 

ctgggtgttc tcgggaactc tgaggagtaa tattttattt gggaagaaat atgaaaagga 1620 

acgatatgaa aaagtcataa aggcttgtgc tctgaaaaag gatttacagc tgttggagga 1680 

tggtgatctg actgtgatag gagatcgggg aaccacgctg agtggagggc agaaagcacg 1740 

ggtaaacctt gcaagagcag tgtatcaaga tgctgacatc tatctcctgg acgatcctct 1800 

cagtgcagta gatgcggaag ttagcagaca cttgttcgaa ctgtgtattt gtcaaatttt 1860 

gcatgagaag atcacaattt tagtgactca tcagttgcag tacctcaaag ctgcaagtca 1920 

gattctgata ttgaaagatg gtaaaatggt gcagaagggg acttacactg agttcctaaa 1980 

atctggtata gattttggct cccttttaaa gaaggataat gaggaaagtg aacaacctcc 2040 

agttccagga actcccacac taaggaatcg taccttctca gagtcttcgg tttggtctca 2100 

acaatcttct agaccctcct tgaaagatgg tgctctggag agccaagata cagagaatgt 2160 

cccagttaca ctatcagagg agaaccgttc tgaaggaaaa gttggttttc aggcctataa 2220 

gaattacttc agagctggtg ctcactggat tgtcttcatt ttccttattc tcctaaacac 2280 

tgcagctcag gttgcctatg tgcttcaaga ttggtggctt tcatactggg caaacaaaca 2340 

aagtatgcta aatgtcactg taaatggagg aggaaatgta accgagaagc tagatcttaa 2400 

ctggtactta ggaatttatt caggtttaac tgtagctacc gttctttttg gcatagcaag 2460 

atctctattg gtattctacg tccttgttaa ctcttcacaa actttgcaca acaaaatgtt 2520 

tgagtcaatt ctgaaagctc cggtattatt ctttgataga aatccaatag gaagaatttt 2580 

aaatcgtttc tccaaagaca ttggacactt ggatgatttg ctgccgctga cgtttttaga 2640 

tttcatccag acattgctac aagtggttgg tgtggtctct gtggctgtgg ccgtgattcc 2700 

ttggatcgca atacccttgg ttccccttgg aatcattttc atttttcttc ggcgatattt 2760 

tttggaaacg tcaagagatg tgaagcgcct ggaatctaca actcggagtc cagtgttttc 2820 

ccacttgtca tcttctctcc aggggctctg gaccatccgg gcatacaaag cagaagagag 2880 

gtgtcaggaa ctgtttgatg cacaccagga tttacattca gaggcttggt tcttgttttt 2940 

gacaacgtcc cgctggttcg ccgtccgtct ggatgccatc tgtgccatgt ttgtcatcat 3000 

cgttgccttt gggtccctga ttctggcaaa aactctggat gccgggcagg ttggtttggc 3060 

actgtcctat gccctcacgc tcatggggat gtttcagtgg tgtgttcgac aaagtgctga 3120 

agttgagaat atgatgatct cagtagaaag ggtcattgaa tacacagacc ttgaaaaaga 3180 

agcaccttgg gaatatcaga aacgcccacc accagcctgg ccccatgaag gagtgataat 3240 

ctttgacaat gtgaacttca tgtacagtcc aggtgggcct ctggtactga agcatctgac 3300 

agcactcatt aaatcacaag aaaaggttgg cattgtggga agaaccggag ctggaaaaag 3360 

ttccctcatc tcagcccttt ttagattgtc agaacccgaa ggtaaaattt ggattgataa 3420 

gatcttgaca actgaaattg gacttcacga tttaaggaag aaaatgtcaa tcatacctca 3480 

ggaacctgtt ttgttcactg gaacaatgag gaaaaacctg gatcccttta atgagcacac 3540 

ggatgaggaa ctgtggaatg ccttacaaga ggtacaactt aaagaaacca ttgaagatct 3600 

tcctggtaaa atggatactg aattagcaga atcaggatcc aattttagtg ttggacaaag 3660 

acaactggtg tgccttgcca gggcaattct caggaaaaat cagatattga ttattgatga 3720 
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agcgacggca 
atttgcccac 
caagataatg 
gcaaaataaa 
tgccctcact 
cactgaccac 
gacagcactg 
tttggactat 
caagatgcta 
aaatggttta 
aggccaccag 
ctcattttta 
ccaaaattac 
gttgccattt 
gcttcttgtt 
tgacattgta 
ggttaaagga 
actactttct 
ccgtccatcc 
cagcattgtg 
cattattctt 
tttggtttgg 
tgagacaaac 
gcatccaggg 
cattgtcatt 
ttgtacttta 
tgagttcagt 
agaataaggt 
tttgtttgta 
caacagatgt 
aaatacattg 
tgttcagtcc 
ggtgactgaa 
aatgtctata 
ctgttttaat 
gttttcatct 



aatgtggatc 
tgcaccgtgc 
gttttagatt 
gagagcctat 
gaaacagcaa 
atggttacaa 
tgaatccaac 
gtaaaccaca 
gttcatttga 
tttttattta 
ttaaatgccg 
aaggagtagg 
atgttaattt 
tgtctcattg 
agaaaatagt 
gtagggagtg 
tagaagggca 
caaaatggaa 
tcctttagag 
acacttccca 
ttttggattt 
gctgtattga 
tgaaaattat 
gttcattttc 
tgggctttct 
tttgaactat 
ttactaaagg 
gtcacctccc 
tacatatatt 
tttattgagt 
accgtagtag 
aacaaaccac 
attgcaacag 
tgtttatttg 
gttatcatag 
cca 



caagaactga 
taaccattgc 
caggaagact 
tttacaagat 
aacaggtata 
acacttccaa 
caaaatgtca 
ttgtactttt 
atatttctcc 
aatgtcaata 
tctatcaggt 
acaaagttgt 
ccatttatat 
ttttctttga 
acagttacaa 
tgtacccctt 
atattttatc 
gcccttaaag 
ttgcatgact 
agaaggccaa 
ctcatttatg 
aatcctttct 
tgcaagcatt 
atgagcctgt 
gttgaatgaa 
gagtctttaa 
ttttactatt 
tgaaattgca 
tgtccttcgt 
gaagccttaa 
ctgttcaact 
acacagtaaa 
tgatcataat 
gacttttgag 
aattttttaa 



tgagttaata 
acacagattg 
gaaagaatat 
ggtgcaacaa 
cttcaaaaga 
tggacagccc 
agtccgttcc 
ttttactttg 
caacttatcc 
gttgtttttt 
tttgtgcctt 
cacaggtttt 
cagggattct 
cataactagg 
ccaataggaa 
actccccatc 
atatgttcta 
gtgctttgat 
tggacacggt 
acctctaacc 
gaaggctaac 
aaattgcatg 
gactataatt 
ccaggttagt 
tcaacaaacc 
tttttcctga 
atggtttgaa 
tatatgtata 
atagcaagtt 
aaagcacaca 
cctagtactt 
tgtttattaa 
gaggtttgtt 
gttaaagaca 
tgaaactaaa 



caaaaaaaaa 
aacaccatta 
gatgagccgt 
ctgggcaagg 
aattatccac 
tcgaccttaa 
gaaggcattt 
gcaacaaata 
aaggatctcc 
aaaatccaaa 
aagagactac 
tgttgttgtt 
atttacttga 
atccattatt 
caacaaaaag 
aaaaaaaaaa 
aaagagaagg 
actgaaggac 
aactgttgca 
gacattcctg 
cctctgttga 
aataggctct 
atgcagtacg 
ttactcctga 
acaatacttc 
tgatggtggc 
gtggagtctc 
tagacatgca 
ttttgctcat 
ccacacacag 
agaaatacac 
tagtcatggt 
aaaacgatag 
gtcatataaa 
ttcaattgaa 



tccgggagaa 
ttgacagcga 
atgttttgct 
cagaagccgc 
atattggtca 
ctattttcga 
tccactagtt 
tttatacata 
agctctaaca 
tcagaggtgc 
agagtcaaag 
tttattgccc 
agactgtgaa 
tcccctgaag 
aaaaagtttg 
atggatacat 
aagagaaaat 
acaaatgtga 
gttttagact 
aaatacgtgg 
ccgtaagcct 
gctaacgtga 
ttctcaggat 
ccactaatag 
ctgggacctt 
tgtaatatgt 
atgacctctc 
cacgtgtgca 
cagcagagag 
ctaactgcca 
gtatggttaa 
tcgtatttta 
ctatattcaa 
cgtcctgttt 
ataaatgata 



3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5833 



<210> 15 

<211> 1182 

<212> DNA 

<213> Homo sapiens 



<400> 15 

actcaatgtc tgggctccaa accatgcata atatccacaa ctaaaatggc catgtcacca 60 

agagagcttc ctctatttct cagattactg gaagattcat gcccaggagt atccataatt 120 

tagcattcct gggatccgga cattctctct atcccaattt ttaatcatct gagtctgtat 180 

taactgcttg ccagcagtta atacagactc agcattgtga cacttcccaa gaaggccaaa 240 

cctctaaccg acattcctga aatacgtggc attattcttt tttggatttc tcatttatgg 300 

aaggctaacc ctctgttgac cgtaagcctt ttggtttggg ctgtattgaa atcctttcta 360 

aattgcatga ataggctctg ctaacgtgat gagacaaact gaaaattatt gcaagcattg 420 

actataatta tgcagtacgt tctcaggatg catccagggg ttcattttca tgagcctgtc 480 

caggttagtt tactcctgac cactaatagc attgtcattt gggctttctg ttgaatgaat 540 

caacaaacca caatacttcc tgggaccttt tgtactttat ttgaactatg agtctttaat 600 

ttttcctgat gatggtggct gtaatatgtt gagttcagtt tactaaaggt tttactatta 660 

tggtttgaag tggagtctca tgacctctca gaataaggtg tcacctccct gaaattgcat 720 

atatgtatat agacatgcac acgtgtgcat ttgtttgtat acatatattt gtccttcgta 780 

tagcaagttt tttgctcatc agcagagagc aacagatgtt ttattgagtg aagccttaaa 840 

aagcacacac cacacacagc taactgccaa aatacattga ccgtagtagc tgttcaactc 900 



13 



WO 03/104404 



PCT/US03/17772 



ctagtactta gaaatacacg tatggttaat gttcagtcca acaaaccaca cacagtaaat 960 

gtttattaat agtcatggtt cgtattttag gtgactgaaa ttgcaacagt gatcataatg 1020 

aggtttgtta aaacgatagc tatattcaaa atgtctatat gtttatttgg acttttgagg 1080 

ttaaagacag tcatataaac gtcctgtttc tgttttaatg ttatcataga attttttaat 1140 

gaaactaaat tcaattgaaa taaatgatag ttttcatctc ca 1182 



420 
480 
540 
600 



<210> 16 

<211> 5862 

<212> DNA 

<213> Homo sapiens 

<400> 16 

ggacaggcgt ggcggccgga gccccagcat ccctgcttga ggtccaggag cggagcccgc 60 

ggccaccgcc gcctgatcag cgcgaccccg gcccgcgccc gccccgcccg gcaagatgct 120 

gcccgtgtac caggaggtga agcccaaccc gctgcaggac gcgaacatct gctcacgcgt 180 

gttcttctgg tggctcaatc ccttgtttaa aattggccat aaacggagat tagaggaaga 240 

tgatatgtat tcagtgctgc cagaagaccg ctcacagcac cttggagagg agttgcaagg 300 

gttctgggat aaagaagttt taagagctga gaatgacgca cagaagcctt ctttaacaag 360 
agcaatcata aagtgttact ggaaatctta tttagttttg ggaattttta cgttaattga 
ggaaagtgcc aaagtaatcc agcccatatt tttgggaaaa attattaatt attttgaaaa 
ttatgatccc atggattctg tggctttgaa cacagcgtac gcctatgcca cggtgctgac 
tttttgcacg ctcattttgg ctatactgca tcacttatat ttttatcacg ttcagtgtgc 

tgggatgagg ttacgagtag ccatgtgcca tatgatttat cggaaggcac ttcgtcttag 660 

taacatggcc atggggaaga caaccacagg ccagatagtc aatctgctgt ccaatgatgt 720 

gaacaagttt gatcaggtga cagtgttctt acacttcctg tgggcaggac cactgcaggc 780 

gatcgcagtg actgccctac tctggatgga gataggaata tcgtgccttg ctgggatggc 840 

agttctaatc attctcctgc ccttgcaaag ctgttttggg aagttgttct catcactgag 900 

gagtaaaact gcaactttca cggatgccag gatcaggacc atgaatgaag ttataactgg 960 

tataaggata ataaaaatgt acgcctggga aaagtcattt tcaaatctta ttaccaattt 1020 

gagaaagaag gagatttcca agattctgag aagttcctgc ctcaggggga tgaatttggc 1080 

ttcgtttttc agtgcaagca aaatcatcgt gtttgtgacc ttcaccacct acgtgctcct 1140 

cggcagtgtg atcacagcca gccgcgtgtt cgtggcagtg acgctgtatg gggctgtgcg 1200 

gctgacggtt accctcttct tcccctcagc cattgagagg gtgtcagagg caatcgtcag 1260 

catccgaaga atccagacct ttttgctact tgatgagata tcacagcgca accgtcagct 1320 

gccgtcagat ggtaaaaaga tggtgcatgt gcaggatttt actgcttttt gggataaggc 1380 

atcagagacc ccaactctac aaggcctttc ctttactgtc agacctggcg aattgttagc 1440 

tgtggtcggc cccgtgggag cagggaagtc atcactgtta agtgccgtgc tcggggaatt 1500 

ggccccaagt cacgggctgg tcagcgtgca tggaagaatt gcctatgtgt ctcagcagcc 1560 

ctgggtgttc tcgggaactc tgaggagtaa tattttattt gggaagaaat atgaaaagga. 1620 

acgatatgaa aaagtcataa aggcttgtgc tctgaaaaag gatttacagc tgttggagga 1680 

tggtgatctg actgtgatag gagatcgggg aaccacgctg agtggagggc agaaagcacg 1740 
ggtaaacctt gcaagagcag tgtatcaaga tgctgacatc tatctcctgg acgatcctct 
cagtgcagta gatgcggaag ttagcagaca cttgttcgaa ctgtgtattt gtcaaatttt 

gcatgagaag atcacaattt tagtgactca tcagttgcag tacctcaaag ctgcaagtca 1920 

gattctgata ttgaaagatg gtaaaatggt gcagaagggg acttacactg agttcctaaa 1980 

atctggtata gattttggct cccttttaaa gaaggataat gaggaaagtg aacaacctcc 2040 

agttccagga actcccacac taaggaatcg taccttctca gagtcttcgg tttggtctca 2100 

acaatcttct agaccctcct tgaaagatgg tgctctggag agccaagata cagagaatgt 2160 

cccagttaca ctatcagagg agaaccgttc tgaaggaaaa gttggttttc aggcctataa 2220 

gaattacttc agagctggtg ctcactggat tgtcttcatt ttccttattc tcctaaacac 2280 

tacaqctcaq qttgcctatg tgcttcaaga ttggtggctt tcatactggg caaacaaaca 2340 

24 00 
2460 
2520 



1800 
1860 



aagtatgcta aatgtcactg taaatggagg aggaaatgta accgagaagc tagatcttaa 

ctggtactta ggaatttatt caggtttaac tgtagctacc gttctttttg gcatagcaag 

atctctattg gtattctacg tccttgttaa ctcttcacaa actttgcaca acaaaatgtt 

tgagtcaatt ctgaaagctc cggtattatt ctttgataga aatccaatag gaagaatttt 2580 

aaatcgtttc tccaaagaca ttggacactt ggatgatttg ctgccgctga cgtttttaga 2640 

tttcatccag acattgctac aagtggttgg tgtggtctct gtggctgtgg ccgtgattcc 2700 

ttggatcgca atacccttgg ttccccttgg aatcattttc atttttcttc ggcgatattt 2760 
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tttggaaacg tcaagagatg tgaagcgcct ggaatctaca actcggagtc cagtgttttc 2820 

ccacttgtca tcttctctcc aggggctctg gaccatccgg gcatacaaag cagaagagag 2880 

gtgtcaggaa ctgtttgatg cacaccagga tttacattca gaggcttggt tcttgttttt 2940 

gacaacgtcc cgctggttcg ccgtccgtct ggatgccatc tgtgccatgt ttgtcatcat 3000 

cgttgccttt gggtccctga ttctggcaaa aactctggat gccgggcagg ttggtttggc 3060 

actgtcctat gccctcacgc tcatggggat gtttcagtgg tgtgttcgac aaagtgctga 3120 

agttgagaat atgatgatct cagtagaaag ggtcattgaa tacacagacc ttgaaaaaga 3180 

agcaccttgg gaatatcaga aacgcccacc accagcctgg ccccatgaag gagtgataat 3240 

ctttgacaat gtgaacttca tgtacagtcc aggtgggcct ctggtactga agcatctgac 3300 

agcactcatt aaatcacaag aaaaggttgg cattgtggga agaaccggag ctggaaaaag 3360 

ttccctcatc tcagcccttt ttagattgtc agaacccgaa ggtaaaattt ggattgataa 3420 

gatcttgaca actgaaattg gacttcacga tttaaggaag aaaatgtcaa tcatacctca 3480 

ggaacctgtt ttgttcactg gaacaatgag gaaaaacctg gatcccttta atgagcacac 3540 

ggatgaggaa ctgtggaatg ccttacaaga ggtacaactt aaagaaacca ttgaagatct 3600 

tcctggtaaa atggatactg aattagcaga atcaggatcc aattttagtg ttggacaaag 3660 

acaactggtg tgccttgcca gggcaattct caggaaaaat cagatattga ttattgatga 3720 

agcgacggca aatgtggatc caagaactga tgagttaata caaaaaaaaa tccgggagaa 3780 

atttgcccac tgcaccgtgc taaccattgc acacagattg aacaccatta ttgacagcga 3840 

caagataatg gttttagatt caggaagact gaaagaatat gatgagccgt atgttttgct 3900 

gcaaaataaa gagagcctat tttacaagat ggtgcaacaa ctgggcaagg cagaagccgc 3960 

tgccctcact gaaacagcaa aacaggtata cttcaaaaga aattatccac atattggtca 4020 

cactgaccac atggttacaa acacttccaa tggacagccc tcgaccttaa ctattttcga 4080 

gacagcactg tgaatccaac caaaatgtca agtccgttcc gaaggcattt tccactagtt 4140 

tttggactat gtaaaccaca ttgtactttt ttttactttg gcaacaaata tttatacata 4200 

caagatgcta gttcatttga atatttctcc caacttatcc aaggatctcc agctctaaca 4260 

aaatggttta tttttattta aatgtcaata gttgtttttt aaaatccaaa tcagaggtgc 4320 

aggccaccag ttaaatgccg tctatcaggt tttgtgcctt aagagactac agagtcaaag 4380 

ctcattttta aaggagtagg acaaagttgt cacaggtttt tgttgttgtt tttattgccc 4440 

ccaaaattac atgttaattt ccatttatat cagggattct atttacttga agactgtgaa 4500 

gttgccattt tgtctcattg ttttctttga cataactagg atccattatt tcccctgaag 4560 

gcttcttgtt agaaaatagt acagttacaa ccaataggaa caacaaaaag aaaaagtttg 4620 

tgacattgta gtagggagtg tgtacccctt actccccatc aaaaaaaaaa atggatacat 4 680 

ggttaaagga tagaagggca atattttatc atatgttcta aaagagaagg aagagaaaat 4740 

actactttct caaaatggaa gcccttaaag gtgctttgat actgaaggac acaaatgtga 4800 

ccgtccatcc tcctttagag ttgcatgact tggacacggt aactgttgca gttttagact 4860 

cagcattgtg acacttccca agaaggccaa acctctaacc gacattcctg aaatacgtgg 4920 

cattattctt ttttggattt ctcatttatg gaaggctaac cctctgttga ccgtaagcct 4 980 

tttggtttgg gctgtattga aatcctttct aaattgcatg aataggctct gctaacgtga 5040 

tgagacaaac tgaaaattat tgcaagcatt gactataatt atgcagtacg ttctcaggat 5100 

gcatccaggg gttcattttc atgagcctgt ccaggttagt ttactcctga ccactaatag 5160 

cattgtcatt tgggctttct gttgaatgaa tcaacaaacc acaatacttc ctgggacctt 5220 

ttgtacttta tttgaactat gagtctttaa tttttcctga tgatggtggc tgtaatatgt 5280 

tgagttcagt ttactaaagg ttttactatt atggtttgaa gtggagtctc atgacctctc 5340 

agaataaggt gtcacctccc tgaaattgca tatatgtata tagacatgca cacgtgtgca 5400 

tttgtttgta tacatatatt tgtccttcgt atagcaagtt ttttgctcat cagcagagag 5460 

caacagatgt tttattgagt gaagccttaa aaagcacaca ccacacacag ctaactgcca 5520 

aaatacattg accgtagtag ctgttcaact cctagtactt agaaatacac gtatggttaa 5580 

tgttcagtcc aacaaaccac acacagtaaa tgtttattaa tagtcatggt tcgtatttta 5640 

ggtgactgaa attgcaacag tgatcataat gaggtttgtt aaaacgatag ctatattcaa 5700 

aatgtctata tgtttatttg gacttttgag gttacatgca gtgacattca ggtcgcgttt 5760 

cggttgtcat gagatcagag ggtttttgat ccagacccac ccccgcccca ccccgcgcgc 5820 

ctttgctgcc tcttccagag acccagcttg cagagcggcc gc 5862 

<210> 17 

<211> 1118 

<212> DNA 

<213> Homo sapiens 
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<400> 17 

taaactagtc caggttccca cagctcgtaa gctgctgtct cttgtctccc aggtctcagc 60 

tcagctgtgt ctttgggaca tctttccaga ccattcactc tgtaggagcc acctaccgct 120 

gagtctgtag cagttaatac agactcagca ttgtgacact tcccaagaag gccaaacctc 180 

taaccgacat tcctgaaata cgtggcatta ttcttttttg gatttctcat ttatggaagg 240 

ctaaccctct gttgaccgta agccttttgg tttgggctgt attgaaatcc tttctaaatt 300 

gcatgaatag gctctgctaa cgtgatgaga caaactgaaa attattgcaa gcattgacta 360 

taattatgca gtacgttctc aggatgcatc caggggttca ttttcatgag cctgtccagg 420 

ttagtttact cctgaccact aatagcattg tcatttgggc tttctgttga atgaatcaac 480 

aaaccacaat acttcctggg accttttgta ctttatttga actatgagtc tttaattttt 540 

cctgatgatg gtggctgtaa tatgttgagt tcagtttact aaaggtttta ctattatggt 600 

ttgaagtgga gtctcatgac ctctcagaat aaggtgtcac ctccctgaaa ttgcatatat 660 

gtatatagac atgcacacgt gtgcatttgt ttgtatacat atatttgtcc ttcgtatagc 720 

aagttttttg ctcatcagca gagagcaaca gatgttttat tgagtgaagc cttaaaaagc 780 

acacaccaca cacagctaac tgccaaaata cattgaccgt agtagctgtt caactcctag 840 

tacttagaaa tacacgtatg gttaatgttc agtccaacaa accacacaca gtaaatgttt 900 

attaatagtc atggttcgta ttttaggtga ctgaaattgc aacagtgatc ataatgaggt 960 

ttgttaaaac gatagctata ttcaaaatgt ctatatgttt atttggactt ttgaggttaa 1020 

agacagtcat ataaa cc tgtttctgtt ttaatgttat catagaattt tttaatgaaa 1080 

ctaaattcaa ttgaaataaa tgatagtttt catctcca 1118 



<210> 
<211> 
<212> 



18 

1531 
DNA 



<213> Homo sapiens 



<220> 
<221> 
<222> 
<223> 



niisc_f eature 
(1) . . (1531) 
n=a,t,g or c 



<400> 18 

caagcagggc 

aacaaggaga 

tttagtggtt 

tgaaaccata 

aattaaatct 

tacagtgttg 

attaatgtca 

agaaggagcc 

tagctctgga 

tgtttcagtg 

gggtcattga 

caccagcctg 

caggtgggcc 

gcattgtggg 

cagaacccga 

atttaaggaa 

ggaaaaacct 

aggtacaact 

aatcaggatc 

tcaggaaaaa 

atgagttaat 

cacacagatt 

tgaaagaata 

ggtgggtggc 

ngccaaaccg 

gaccccttgg 



atgagttcac 
gtagtatttt 
tccagctgtg 
ataatgaatg 
atgtcgatcc 
ttttccccat 
ttaaatgtga 
aggcttaggt 
tgccgggcag 
gtgtgttcga 
atacacagac 
gccccatgaa 
tctggtactg 
aagaaccgga 
aggtaaaatt 
gaaaatgtca 
ggatcccttt 
taaagaaacc 
caattttagt 
tcagatattg 
acaaaaaaaa 
gaacaccatt 
tgatgagccg 
caaccacttg 
ggttttactt 
ggttnccaac 



caccccagta 
agtctcaaaa 
gttgcattgc 
tctgaataat 
tgttggaatt 
tgtatcttga 
attgtttcaa 
ttgaagtgag 
gttggtttgg 
caaagtgctg 
cttgaaaaag 
ggagtgataa 
aagcatctga 
gctggaaaaa 
tggattgata 
atcatacctc 
aatgagcaca 
attgaagatc 
gttggacaaa 
attattgatg 
atccgggaga 
attgacagcg 
tatgttttgc 
gggccagggc 
tcccaaaggg 
cctttccctt 



acaacagtgc 
gcattggcac 
acgtagaaag 
gacattattt 
cataaaatca 
tctcaagcaa 
cattatgaag 
actgacttta 
cactgtccta 
aagttgagaa 
aagcaccttg 
tctttgacaa 
cagcactcat 
gttccctcat 
agatcttgac 
aggaacctgt 
cggatgagga 
ttcctggtaa 
gacaactggt 
aagcgacggc 
aatttgccca 
acaagataat 
tgcaaaataa 
nggaggccgc 
atttttnccc 
t 



attacaatac 
ttacaaccta 
tggaataatg 
cttgcgtttg 
tctaaaaatt 
caaatggtaa 
ggttcctctt 
ttcccttctt 
tgccctcacg 
tatgatgatc 
ggaatatcag 
tgtgaacttc 
taaatcacaa 
ctcagccctt 
aactgaaatt 
tttgttcact 
actgtggaat 
aatggatact 
gtgccttgcc 
aaatgtggat 
ctgcaccgtg 
ggttttagat 
agagagccta 
ttgcccttca 
cctttttggg 



ttaataatgg 
ttgctttgtt 
taatgggctt 
taatactgtt 
tttctaaata 
aagtatagct 
ggtaagtggc 
cctccttacg 
ctcatgggga 
tcagtagaaa 
aaacgcccac 
atgtacagtc 
gaaaaggttg 
tttagattgt 
ggacttcacg 
ggaacaatga 
gccttacaag 
gaattagcag 
agggcaattc 
ccaagaactg 
ctaaccattg 
tcaggaagac 
ttttttacaa 
cttggaaacc 
ttcaccantg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1531 
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<210> 19 

<211> 286 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Leu His Leu Leu Gly Ser Ser Asp Ser Pro Val Ser Ala Ser Arg Val 
15 10 15 

Ala Gly lie Thr Gly Ala Pro His Pro Ala Trp Leu Leu Phe Val Phe 
20 25 30 

Leu Val Glu Met Gly Phe His His Val Gly Gin Ala Gly Leu Glu Leu 
35 40 45 

Leu Thr Thr Ser Glu Pro Pro Ala Ser Ala Ser Gin Ser Ala Gly lie 
50 55 60 

Thr Gly Met Ser His His Thr Gin Pro Gly His Met Phe Asn lie Ser 
65 70 75 80 

Leu Ser Tyr Glu Pro Val Leu Phe Thr Gly Thr Met Arg Lys Asn Leu 
85 90 " 95 

Asp Pro Phe Asn Glu His Thr Asp Glu Glu Leu Trp Asn Ala Leu Gin 
100 105 110 

Glu Val Gin Leu Lys Glu Thr lie Glu Asp Leu Pro Gly Lys Met Asp 
115 120 125 

Thr Glu Leu Ala Glu Ser Gly Ser Asn Phe Ser Val Gly Gin Arg Gin 
130 135 140 

Leu Val Cys Leu Ala Arg Ala lie Leu Arg Lys Asn Gin lie Leu lie 
145 150 155 160 

lie Asp Glu Ala Thr Ala Asn Val Asp Pro Arg Thr Asp Glu Leu lie 
165 170 175 

Gin Lys Lys lie Arg Glu Lys Phe Ala His Cys Thr Val Leu Thr lie 
180 185 190 

Ala His Arg Leu Asn Thr lie lie Asp Ser Asp Lys lie Met Val Leu 
195 200 205 

Asp Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr Val Leu Leu Gin 
210 215 220 

Asn Lys Glu Ser Leu Phe Tyr Lys Met Val Gin Gin Leu Gly Lys Ala 
225 230 235 240 

Glu Ala Ala Ala Leu Thr Glu Thr Ala Lys Gin Val Tyr Phe Lys Arg 



245 



250 



255 



Asn Tyr Pro His 
2 60 



He Gly His Thr Asp 
265 



His Met Val Thr Asn Thr 
270 



Ser 



Asn Gly Gin Pro 
275 



Ser Thr Leu Thr He 
280 



Phe Glu Thr Ala Leu 
285 
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<210> 20 

<211> 225 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Asp Leu He Asn Pro Asn Phe Thr Phe Gly Phe Trp Arg Phe Lys Glu 
15 10 15 

Glu Asn Val Asn His Thr Glu Pro Val Leu Phe Thr Gly Thr Met Arg 
20 25 30 

Lys Asn Leu Asp Pro Phe Asn Glu His Thr Asp Glu Glu Leu Trp Asn 
35 40 45 

Ala Leu Gin Glu Val Gin Leu Lys Glu Thr He Glu Asp Leu Pro Gly 
50 55 60 

Lys Met Asp Thr Glu Leu Ala Glu Ser Gly Ser Asn Phe Ser Val Gly 
65 L 70 75 80 

Gin Arg Gin Leu Val Cys Leu Ala Arg Ala He Leu Arg Lys Asn Gin 
85 90 95 

He Leu He He Asp Glu Ala Thr Ala Asn Val Asp Pro Arg Thr Asp 
100 105 110 

Glu Leu He Gin Lys Lys He Arg Glu Lys Phe Ala His Cys Thr Val 
115 120 125 

Leu Thr He Ala His Arg Leu Asn Thr He He Asp Ser Asp Lys He 
130 135 140 

Met Val Leu Asp Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr Val 
145 150 155 160 

Leu Leu Gin Asn Lys Glu Ser Leu Phe Tyr Lys Met Val Gin Gin Leu 
165 170 175 

Gly Lys Ala Glu Ala Ala Ala Leu Thr Glu Thr Ala Lys Gin Val Tyr 
180 185 190 

Phe Lys Arg Asn Tyr Pro His He Gly His Thr Asp His Met Val Thr 
195 200 205 

Asn Thr Ser Asn Gly Gin Pro Ser Thr Leu Thr He Phe Glu Thr Ala 
210 215 220 

Leu 
225 
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<211> 1325 

<212> PRT 
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Met Leu Pro Val Tyr Gin Glu Val Lys Pro Asn Pro Leu Gin Asp Ala 
15 10 15 

Asn lie Cys Ser Arg Val Phe Phe Trp Trp Leu Asn Pro Leu Phe Lys 
20 25 30 

lie Gly His Lys Arg Arg Leu Glu Glu Asp Asp Met Tyr Ser Val Leu 
35 40 45 

Pro Glu Asp Arg Ser Gin His Leu Gly Glu Glu Leu Gin Gly Phe Trp 
50 55 60 

Asp Lys Glu Val Leu Arg Ala Glu Asn Asp Ala Gin Lys Pro Ser Leu 
65 7 0 7 5 " 80 

Thr Arg Ala lie lie Lys Cys Tyr Trp Lys Ser Tyr Leu Val Leu Gly 
85 90 95 

lie Phe Thr Leu He Glu Glu Ser Ala Lys Val He Gin Pro He Phe 
100 105 110 

Leu Gly Lys He He Asn Tyr Phe Glu Asn Tyr Asp Pro Met Asp Ser 
115 120 125 

Val Ala Leu Asn Thr Ala Tyr Ala Tyr Ala Thr Val Leu Thr Phe Cys 
130 135 140 

Thr Leu He Leu Ala He Leu His His Leu Tyr Phe Tyr His Val Gin 
145 150 155 160 

Cys Ala Gly Met Arg Leu Arg Val Ala Met Cys His Met He Tyr Arg 
165 170 175 

Lys Ala Leu Arg Leu Ser Asn Met Ala Met Gly Lys Thr Thr Thr Gly 
180 185 190 

Gin He Val Asn Leu Leu Ser Asn Asp Val Asn Lys Phe Asp Gin Val 
195 200 205 

Thr Val Phe Leu His Phe Leu Trp Ala Gly Pro Leu Gin Ala He Ala 
210 215 220 

Val Thr Ala Leu Leu Trp Met Glu He Gly He Ser Cys Leu Ala Gly 
225 230 235 240 

Met Ala Val Leu He He Leu Leu Pro Leu Gin Ser Cys Phe Gly Lys 
245 250 255 

Leu Phe Ser Ser Leu Arg Ser Lys Thr Ala Thr Phe Thr Asp Ala Arg 
260 265 270 

He Arg Thr Met Asn Glu Val He Thr Gly He Arg He He Lys Met 
275 280 285 

Tyr Ala Trp Glu Lys Ser Phe Ser Asn Leu lie Thr Asn Leu Arg Lys 
290 295 300 

Lys Glu He Ser Lys He Leu Arg Ser Ser Cys Leu Arg Gly Met Asn 
305 310 315 320 
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Leu Ala Ser Phe Phe Ser Ala Ser Lys He He Val Phe Val Thr Phe 
325 330 335 

Thr Thr Tyr Val Leu Leu Gly Ser Val He Thr Ala Ser Arg Val Phe. 
340 345 350 

Val Ala- Val Thr Leu Tyr Gly Ala Val Arg Leu Thr Val Thr Leu Phe 
355 360 365 

Phe Pro Ser Ala He Glu Arg Val Ser Glu Ala He Val Ser He Arg 
370 375 380 

Arg He Gin Thr Phe Leu Leu Leu Asp Glu He Ser Gin Arg Asn Arg 
385 390 395 400 

Gin Leu Pro Ser Asp Gly Lys Lys Met Val His Val Gin Asp Phe Thr 
405 410 415 

Ala Phe Trp Asp Lys Ala Ser Glu Thr Pro Thr Leu Gin Gly Leu Ser 
420 425 430 

Phe Thr Val Arg Pro Gly Glu Leu Leu Ala Val Val Gly Pro Val Gly 
435 440 445 



Ala Gly Lys Ser Ser Leu Leu Ser Ala Val Leu Gly Glu Leu Ala Pro 
450 455 460 

Ser His Gly Leu Val Ser Val His Gly Arg He Ala Tyr Val Ser Gin 
465 ' 470 475 480 

Gin Pro Trp Val Phe Ser Gly Thr Leu Arg Ser Asn He Leu Phe Gly 
485 490 495 

Lys Lys Tyr Glu Lys Glu Arg Tyr Glu Lys Val He Lys Ala Cys Ala 
500 505 510 

Leu Lys Lys Asp Leu Gin Leu Leu Glu Asp Gly Asp Leu Thr Val He 
515 ' 520 525 

Gly Asp Arg Gly Thr Thr Leu Ser Gly Gly Gin Lys Ala Arg Val Asn 
530 535 540 

Leu Ala Arg Ala Val Tyr Gin Asp Ala Asp He Tyr Leu Leu Asp Asp 
545 J 550 555 560 

Pro Leu Ser Ala Val Asp Ala Glu Val Ser Arg His Leu Phe Glu Leu 
565 570 575 

Cys He Cys Gin He Leu His Glu Lys He Thr He Leu Val Thr His 
580 585 590 

Gin Leu Gin Tyr Leu Lys Ala Ala Ser Gin He Leu He Leu Lys Asp 
595 600 605 

Gly Lys Met Val Gin Lys Gly Thr Tyr Thr Glu Phe Leu Lys Ser Gly 
610 615 620 

He Asp Phe Gly Ser Leu Leu Lys Lys Asp Asn Glu Glu Ser Glu Gin 
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625 



630 



635 



640 



Pro Pro Val Pro Gly Thr Pro Thr Leu Arg Asn Arg Thr Phe Ser Glu 
645 650 655 

Ser Ser Val Trp Ser Gin Gin Ser Ser Arg Pro Ser Leu Lys Asp Gly 
660 665 670 

Ala Leu Glu Ser Gin Asp Thr Glu Asn Val Pro Val Thr Leu Ser Glu 
675 680 685 

Glu Asn Arg Ser Glu Gly Lys Val Gly Phe Gin Ala Tyr Lys Asn Tyr 
690 695 700 

Phe Arg Ala Gly Ala His Trp lie Val Phe lie Phe Leu lie Leu Leu 
705 710 715 720 

Asn Thr Ala Ala Gin Val Ala Tyr Val Leu Gin Asp Trp Trp Leu Ser 
725 730 735 

Tyr Trp Ala Asn Lys Gin Ser Met Leu Asn Val Thr Val Asn Gly Gly 
740 745 750 

Gly Asn Val Thr Glu Lys Leu Asp Leu Asn Trp Tyr Leu Gly lie Tyr 
7 55 7 60 ^ 7 65 

Ser Gly Leu Thr Val Ala Thr Val Leu Phe Gly lie Ala Arg Ser Leu 
770 775 780 

Leu Val Phe Tyr Val Leu Val Asn Ser Ser Gin Thr Leu His Asn Lys 
785 790 795 800 

Met Phe Glu Ser lie Leu Lys Ala Pro Val Leu Phe Phe Asp Arg Asn 
805 810 815 

Pro lie Gly Arg lie Leu Asn Arg Phe Ser Lys Asp lie Gly His Leu 
820 825 830 

Asp Asp Leu Leu Pro Leu Thr Phe Leu Asp Phe lie Gin Thr Leu Leu 
835 840 845 

Gin Val Val Gly Val Val Ser Val Ala Val Ala Val He Pro Trp He 
850 855 860 

Ala He Pro Leu Val Pro Leu Gly He He Phe He Phe Leu Arg Arg 
865 870 875 880 

Tyr Phe Leu Glu Thr Ser Arg Asp Val Lys Arg Leu Glu Ser Thr Thr 
885 890 895 

Arg Ser Pro Val Phe Ser His Leu Ser Ser Ser Leu Gin Gly Leu Trp 
900 905 910 

Thr He Arg Ala Tyr Lys Ala Glu Glu Arg Cys Gin Glu Leu Phe Asp 
915 920 925 



Ala His Gin Asp Leu His Ser Glu Ala Trp Phe Leu Phe Leu Thr Thr 
930 935 940 



21 



WO 03/104404 



PCT/US03/17772 



Ser Arg Trp Phe Ala Val Arg Leu Asp Ala lie Cys Ala Met Phe Val 
945 " * 950 955 960 

He He Val Ala Phe Gly Ser Leu He Leu Ala Lys Thr Leu Asp Ala 
965 970 975 

Gly Gin Val Gly Leu Ala Leu Ser Tyr Ala Leu Thr Leu Met Gly Met 
980 985 990 

Phe Gin Trp Cys Val Arg Gin Ser Ala Glu Val Glu Asn Met Met He 
995 1000 1005 

Ser Val Glu Arg Val He Glu Tyr Thr Asp Leu Glu Lys Glu Ala 
1010 1015 1020 

Pro Trp Glu Tyr Gin Lys Arg Pro Pro Pro Ala Trp Pro His Glu 
1025 " 1030 1035 

Gly Val He He Phe Asp Asn Val Asn Phe Met Tyr Ser Pro Gly 
1040 1045 1050 

Gly Pro Leu Val Leu Lys His Leu Thr Ala Leu He Lys Ser Gin 
1055 1060 1065 

Glu Lys Val Gly He Val Gly Arg Thr Gly Ala Gly Lys Ser Ser 
1070 1075 1080 

Leu He Ser Ala Leu Phe Arg Leu Ser Glu Pro Glu Gly Lys He 
1085 1090 1095 

Trp He Asp Lys He Leu Thr Thr Glu He Gly Leu His Asp Leu 
1100 ^ H05 1110 

Arg Lys Lys Met Ser He He Pro Gin Glu Pro Val Leu Phe Thr 
1115 1120 1125 

Gly Thr Met Arg Lys Asn Leu Asp Pro Phe Asn Glu His Thr Asp 
1130 ' H35 1140 

Glu Glu Leu Trp Asn Ala Leu Gin Glu Val Gin Leu Lys Glu Thr 
1145 1150 1155 

He Glu Asp Leu Pro Gly Lys Met Asp Thr Glu Leu Ala Glu Ser 
1160 H65 1170 

Gly Ser Asn Phe Ser Val Gly Gin Arg Gin Leu Val Cys Leu Ala 
1175 H80 1185 

Arg Ala He Leu Arg Lys Asn Gin He Leu He He Asp Glu Ala 
1190 ~ H95 1200 

Thr Ala Asn Val Asp Pro Arg Thr Asp Glu Leu He Gin Lys Lys 
1205 1210 1215 

He Arg Glu Lys Phe Ala His Cys Thr Val Leu Thr He Ala His 
1220 1225 1230 

Arg Leu Asn Thr He He Asp Ser Asp Lys He Met Val Leu Asp 
1235 1240 1245 
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Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr Val Leu Leu Gin 
1250 1255 1260 

Asn Lys Glu Ser Leu Phe Tyr Lys Met Val Gin Gin Leu Gly Lys 
1265 1270 1275 

Ala Glu Ala Ala Ala Leu Thr Glu Thr Ala Lys Gin Val Tyr Phe 
1280 1285 1290 

Lys Arg Asn Tyr Pro His He Gly His Thr Asp His Met Val Thr 
1295 "* 1300 1305 

Asn Thr Ser Asn Gly Gin Pro Ser Thr Leu Thr He Phe Glu Thr 
1310 1315 1320 

Ala Leu 
1325 



<210> 22 

<211> 332 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (1)..(332) 

<223> X = any amino acid 

<400> 22 

Met Gly Met Phe Gin Trp Cys Val Arg Gin Ser Ala Glu Val Glu Asn 
15 10 15 

Met Met He Ser Val Glu Arg Val He Glu Tyr Thr Asp Leu Glu Lys 
20 25 30 

Glu Ala Pro Trp Glu Tyr Gin Lys Arg Pro Pro Pro Ala Trp Pro His 
35 40 45 

Glu Gly Val lie He Phe Asp Asn Val Asn Phe Met Tyr Ser Pro Gly 
50 55 60 

Gly Pro Leu Val Leu Lys His Leu Thr Ala Leu He Lys Ser Gin Glu 
65 70 75 80 

Lys Val Gly He Val Gly Arg Thr Gly Ala Gly Lys Ser Ser Leu He 
85 90 95 

Ser Ala Leu Phe Arg Leu Ser Glu Pro Glu Gly Lys lie Trp He Asp 
100 105 110 

Lys He Leu Thr Thr Glu He Gly Leu His Asp Leu Arg Lys Lys Met 
115 120 125 

Ser He He Pro Gin Glu Pro Val Leu Phe Thr Gly Thr Met Arg Lys 
130 135 140 
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Asn Leu Asp Pro Phe Asn Glu His Thr Asp Glu Glu Leu Trp Asn Ala 
145 150 155 160 

Leu Gin Glu Val Gin Leu Lys Glu Thr lie Glu Asp Leu Pro Gly Lys 
165 170 175 

Met Asp Thr Glu Leu Ala Glu Ser Gly Ser Asn Phe Ser Val Gly Gin 
180 185 190 

Arg Gin Leu Val Cys Leu Ala Arg Ala lie Leu Arg Lys Asn Gin lie 
195 200 205 

Leu He He Asp Glu Ala Thr Ala Asn Val Asp Pro Arg Thr Asp Glu 
210 215 220 

Leu He Gin Lys Lys He Arg Glu Lys Phe Ala His Cys Thr Val Leu 
225 230 235 240 

Thr He Ala His Arg Leu Asn Thr He He Asp Ser Asp Lys He Met 
245 250 255 

Val Leu Asp Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr Val Leu 
260 265 270 

Leu Gin Asn Lys Glu Ser Leu Phe Phe Thr Arg Trp Val Ala Asn His 
275 280 285 

Leu Gly Pro Gly Xaa Glu Ala Ala Cys Pro Ser Leu Gly Asn Xaa Pro 
290 295 300 

Asn Arg Val Leu Leu Ser Gin Arg Asp Phe Xaa Pro Leu Phe Gly Phe 
305 310 315 320 

Thr Xaa Gly Pro Leu Gly Val Xaa Asn Pro Phe Pro 

325 " 330 



<210> 23 

<211> 954 

<212> PRT 

<213> Homo sapiens 

<400> 23 

Met Leu Pro Val Tyr Gin Glu Val Lys Pro Asn Pro Leu Gin Asp Ala 
15 10 15 

Asn lie Cys Ser Arg Val Phe Phe Trp Trp Leu Asn Pro Leu Phe Lys 
20 25 30 

He Gly His Lys Arg Arg Leu Glu Glu Asp Asp Met Tyr Ser Val Leu 
35 40 45 

Pro Glu Asp Arg Ser Gin His Leu Gly Glu Glu Leu Gin Gly Phe Trp 
. 50 * ~ 55 60 

Asp Lys Glu Val Leu Arg Ala Glu Asn Asp Ala Gin Lys Pro Ser Leu 
65 70 75 80 

Thr Arg Ala He He Lys Cys Tyr Trp Lys Ser Tyr Leu Val Leu Gly 
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85 90 95 

He Phe Thr Leu He Glu Glu Ser Ala Lys Val He Gin Pro He Phe 
100 105 110 

Leu Gly Lys lie lie Asn Tyr Phe Glu Asn Tyr Asp Pro Met Asp Ser 
115 120 125 

Val Ala Leu Asn Thr Ala Tyr Ala Tyr Ala Thr Val Leu Thr Phe Cys 
130 135 140 

Thr Leu lie Leu Ala He Leu His His Leu Tyr Phe Tyr His Val Gin 
145 150 155 160 

Cys Ala Gly Met Arg Leu Arg Val Ala Met Cys His Met He Tyr Arg 
165 170 175 

Lys Ala Leu Arg Leu Ser Asn Met Ala Met Gly Lys Thr Thr Thr Gly 
180 185 190 

Gin He Val Asn Leu Leu Ser Asn Asp Val Asn Lys Phe Asp Gin Val 
195 200 205 

Thr Val Phe Leu His Phe Leu Trp Ala Gly Pro Leu Gin Ala lie Ala 
210 215 * 220 

Val Thr Ala Leu Leu Trp Met Glu lie Gly lie Ser Cys Leu Ala Gly 
225 230 235 ~ 240 

Met Ala Val Leu He lie Leu Leu Pro Leu Gin Ser Cys Phe Gly Lys 
245 250 ' 255 

Leu Phe Ser Ser Leu Arg Ser Lys Thr Ala Thr Phe Thr Asp Ala Arg 
260 265 270 

lie Arg Thr Met Asn Glu Val lie Thr Gly lie Arg lie lie Lys Met 
275 280 285 

Tyr Ala Trp Glu Lys Ser Phe Ser Asn Leu lie Thr Asn Leu Arg Lys 
290 295 300 

Lys Glu lie Ser Lys He Leu Arg Ser Ser Cys Leu Arg Gly Met Asn 
305 310 315 320 

Leu Ala Ser Phe Phe Ser Ala Ser Lys He lie Val Phe Val Thr Phe 
325 330 335 

Thr Thr Tyr Val Leu Leu Gly Ser Val lie Thr Ala Ser Arg Val Phe 
340 345 350 

Val Ala Val Thr Leu Tyr Gly Ala Val Arg Leu Thr Val Thr Leu Phe 
355 360 365 

Phe Pro Ser Ala He Glu Arg Val Ser Glu Ala lie Val Ser lie Arg 
370 375 380 

Arg lie Gin Thr Phe Leu Leu Leu Asp Glu He Ser Gin Arg Asn Arg 
385 390 395 400 
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Gin Leu Pro Ser Asp Gly Lys Lys Met Val His Val Gin Asp Phe Thr 
405 410 415 

Ala Phe Trp Asp Lys Ala Ser Glu Thr Pro Thr Leu Gin Gly Leu Ser 
420 425 430 

Phe Thr Val Arg Pro Gly Glu Leu Leu Ala Val Val Gly Pro Val Gly 
435 440 445 

Ala Gly Lys Ser Ser Leu Leu Ser Ala Val Leu Gly Glu Leu Ala Pro 
450 455 460 

Ser His Gly Leu Val Ser Val His Gly Arg He Ala Tyr Val Ser Gin 
465 470 475 480 

Gin Pro Trp Val Phe Ser Gly Thr Leu Arg Ser Asn He Leu Phe Gly 
4 85 4 90 4 95 

Lys Lys Tyr Glu Lys Glu Arg Tyr Glu Lys Val He Lys Ala Cys Ala 
500 505 510 

Leu Lys Lys Asp Leu Gin Leu Leu Glu Asp Gly Asp Leu Thr Val He 
515 520 525 

Gly Asp Arg Gly Thr Thr Leu Ser Gly Gly Gin Lys Ala Arg Val Asn 
. 530 535 540 

Leu Ala Arg Ala Val Tyr Gin Asp Ala Asp He Tyr Leu Leu Asp Asp 
545 550 555 560 

Pro Leu Ser Ala Val Asp Ala Glu Val Ser Arg His Leu Phe Glu Leu 
565 570 575 

Cys He Cys Gin He Leu His Glu Lys He Thr He Leu Val Thr His 
580 585 590 

Gin Leu Gin Tyr Leu Lys Ala Ala Ser Gin He Leu He Leu Lys Asp 
595 600 605 

Gly Lys Met Val Gin Lys Gly Thr Tyr Thr Glu Phe Leu Lys Ser Gly 
610 615 620 

He Asp Phe Gly Ser Leu Leu Lys Lys Asp Asn Glu Glu Ser Glu Gin 
625 630 635 640 

Pro Pro Val Pro Gly Thr Pro Thr Leu Arg Asn Arg Thr Phe Ser Glu 
645 650 655 

Ser Ser Val Trp Ser Gin Gin Ser Ser Arg Pro Ser Leu Lys Asp Gly 
660 665 670 

Ala Leu Glu Ser Gin Asp Thr Glu Asn Val Pro Val Thr Leu Ser Glu 
675 680 685 

Glu Asn Arg Ser Glu Gly Lys Val Gly Phe Gin Ala Tyr Lys Asn Tyr 
690 695 700 

Phe Arg Ala Gly Ala His Trp He Val Phe lie Phe Leu lie Leu Leu 
705 7 1° 715 720 
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Asn Thr Ala Ala Gin Val Ala Tyr Val Leu Gin Asp Trp Trp Leu Ser 
725 730 735 

Tyr Trp Ala Asn Lys Gin Ser Met Leu Asn Val Thr Val Asn Gly Gly 
740 745 750 

Gly Asn Val Thr Glu Lys Leu Asp Leu Asn Trp Tyr Leu Gly He Tyr 
755 760 765 

Ser Gly Leu Thr Val Ala Thr Val Leu Phe Gly He Ala Arg Ser Leu 
770 775 780 

Leu Val Phe Tyr Val Leu Val Asn Ser Ser Gin Thr Leu His Asn Lys 
785 790 795 800 

Met Phe Glu Ser He Leu Lys Ala Pro Val Leu Phe Phe Asp Arg Asn 
805 810 815 

Pro He Gly Arg He Leu Asn Arg Phe Ser Lys Asp He Gly His Leu 
820 825 4 830 

Asp Asp Leu Leu Pro Leu Thr Phe Leu Asp Phe He Gin Thr Leu Leu 
835 840 845 

Gin Val Val Gly Val Val Ser Val Ala Val Ala Val He Pro Trp lie 
850 855 860 

Ala He Pro Leu Val Pro Leu Gly He lie Phe He Phe Leu Arg Arg 
865 870 875 880 

Tyr Phe Leu Glu Thr Ser Arg Asp Val Lys Arg Leu Glu Ser Thr Thr 
885 890 895 

Arg Ser Pro Val Phe Ser His Leu Ser Ser Ser Leu Gin Gly Leu Trp 
900 905 910 

Thr He Arg Ala Tyr Lys Ala Glu Glu Arg Cys Gin Glu Leu Phe Asp 
915 920 925 

Ala His Gin Asp Leu His Ser Gly Leu Ser He Ser Gly Asn Gly Phe 
930 935 940 

Lys Gly Gin Asp Leu Leu Leu Phe Leu Ala 

950 



945 




<210> 


24 


<211> 


301 


<212> 


PRT 


<213> 


Homo 


<400> 


24 



Met Leu Pro Val Tyr Gin Glu Val Lys Pro Asn Pro Leu Gin Asp Ala 

15 10 15 

Asn lie Cys Ser Arg Val Phe Phe Trp Trp Leu Asn Pro Leu Phe Lys 
20 25 30 
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He Gly His Lys Arg Arg Leu Glu Glu Asp Asp Met Tyr Ser Val Leu 
35 40 45 

Pro Glu Asp Arg Ser Gin His Leu Gly Glu Glu Leu Gin Gly Phe Trp 
50 1 " 55 60 

Asp Lys Glu Val Leu Arg Ala Glu Asn Asp Ala Gin Lys Pro Ser Leu 
65 70 75 80 

Thr Arg Ala He He Lys Cys Tyr Trp Lys Ser Tyr Leu Val Leu Gly 
85 90 95 

He Phe Thr Leu He Glu Glu Ser Ala Lys Val He Gin Pro He Phe 
100 105 110 

Leu Gly Lys He He Asn Tyr Phe Glu Asn Tyr Asp Pro Met Asp Ser 
115 120 125 

Val Ala Leu Asn Thr Ala Tyr Ala Tyr Ala Thr Val Leu Thr Phe Cys 
130 135 140 

Thr Leu He Leu Ala He Leu His His Leu Tyr Phe Tyr His Val Gin 
145 150 155 160 

Cys Ala Gly Met Arg Leu Arg Val Ala Met Cys His Met He Tyr Arg 
165 170 175 

Lys Ala Leu Arg Leu Ser Asn Met Ala Met Gly Lys Thr Thr Thr Gly 
180 185 190 

Gin lie Val Asn Leu Leu Ser Asn Asp Val Asn Lys Phe Asp Gin Val 
195 200 205 

Thr Val Phe Leu His Phe Leu Trp Ala Gly Pro Leu Gin Ala He Ala 
210 215 220 

Val Thr Ala Leu Leu Trp Met Glu He Gly He Ser Cys Leu Ala Gly 
225 230 235 240 

Met Ala Val Leu He He Leu Leu Pro Leu Gin Ser Cys Phe Gly Lys 
245 250 255 

Leu Phe Ser Ser Leu Arg Ser Lys Thr Ala Thr Phe Thr Asp Ala Arg 
260 265 270 

He Arg Thr Met Asn Glu Val He Thr Gly He Arg lie lie Lys Met 
275 280 285 

Tyr Ala Cys Gly Lys Thr Gly Ser Ser Glu Ser Leu Ala 
290 295 300 



<210> 25 

<211> 336 

<212> PRT 

<213> Homo sapiens 

<400> 25 

Met Gly Met Phe Gin Trp Cys Val Arg Gin Ser Ala Glu Val Glu Asn 
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10 



15 



Met Met He Ser Val Glu Arg Val He Glu Tyr Thr Asp Leu Glu Lys 
20 25 30 

Glu Ala Pro Trp Glu Tyr Gin Lys Arg Pro Pro Pro Ala Trp Pro His 
35 40 45 

Glu Gly Val He He Phe Asp Asn Val Asn Phe Met Tyr Ser Pro Gly 
50 55 60 

Gly Pro Leu Val Leu Lys His Leu Thr Ala Leu He Lys Ser Gin Glu 
65 70 75 " 80 

Lys Val Gly He Val Gly Arg Thr Gly Ala Gly Lys Ser Ser Leu He 
8 5 90 95 

Ser Ala Leu Phe Arg Leu Ser Glu Pro Glu Gly Lys He Trp He Asp 
100 105 110 

Lys He Leu Thr Thr Glu He Gly Leu His Asp Leu Arg Lys Lys Met 
115 120 125 

Ser He He Pro Gin Glu Pro Val Leu Phe Thr Gly Thr Met Arg Lys 
130 135 140 

Asn Leu Asp Pro Phe Asn Glu His Thr Asp Glu Glu Leu Trp Asn Ala 
145 150 155 160 

Leu Gin Glu Val Gin Leu Lys Glu Thr lie Glu Asp Leu Pro Gly Lys 
165 170 175 

Met Asp Thr Glu Leu Ala Glu Ser Gly Ser Asn Phe Ser Val Gly Gin 
180 185 190 

Arg Gin Leu Val Cys Leu Ala Arg Ala lie Leu Arg Lys Asn Gin lie 
195 200 205 

Leu He He Asp Glu Ala Thr Ala Asn Val Asp Pro Arg Thr Asp Glu 
210 215 220 

Leu lie Gin Lys Lys lie Arg Glu Lys Phe Ala His Cys Thr Val Leu 
225 230 235 240 

Thr He Ala His Arg Leu Asn Thr He lie Asp Ser Asp Lys He Met 
245 250 255 

Val Leu Asp Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr Val Leu 
260 265 270 

Leu Gin Asn Lys Glu Ser Leu Phe Tyr Lys Met Val Gin Gin Leu Gly 
275 280 285 

Lys Ala Glu Ala Ala Ala Leu Thr Glu Thr Ala Lys Gin Val Tyr Phe 
290 295 300 



Lys Arg Asn Tyr Pro His He Gly His Thr Asp His Met Val Thr Asn 
305 310 315 320 
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Thr Ser Asn Gly Gin Pro Ser Thr Leu Thr He Phe Glu Thr Ala Leu 
325 330 335 



<210> 26 

<211> 470 

<212> PRT 

<213> Homo sapiens 

<400> 26 

Met Lys Pro Pro Arg Arg Pro Ala Cys Ala Val Trp Trp He Arg Asp 
15 10 ,15 

His Gly Ala Trp Glu Ala Ala Gly Phe Ser Ala Pro Ser Leu Thr Glu 
20 25 30 

Leu Phe Ser Leu Ser Val Ser Pro Ala Arg Ser Pro Val Phe Ser His 
35 40 45 

Leu Ser Ser Ser Leu Gin Gly Leu Trp Thr He Arg Ala Tyr Lys Ala 
50 55 60 

Glu Glu Arg Cys Gin Glu Leu Phe Asp Ala His Gin Asp Leu His Ser 
65 ~ * 70 75 80 

Glu Ala Trp Phe Leu Phe Leu Thr Thr Ser Arg Trp Phe Ala Val Arg 
85 90 95 

Leu Asp Ala He Cys Ala Met Phe Val He He Val Ala Phe Gly Ser 
100 105 110 

Leu He Leu Ala Lys Thr Leu Asp Ala Gly Gin Val Gly Leu Ala Leu 
115 120 125 

Ser Tyr Ala Leu Thr Leu Met Gly Met Phe Gin Trp Cys Val Arg Gin 
130 135 140 

Ser Ala Glu Val Glu Asn Met Met He Ser Val Glu Arg Val He Glu 
145 150 155 160 

Tyr Thr Asp Leu Glu Lys Glu Ala Pro Trp Glu Tyr Gin Lys Arg Pro 
165 170 175 

Pro Pro Ala Trp Pro His Glu Gly Val He He Phe Asp Asn Val Asn 
180 185 190 

Phe Met Tyr Ser Pro Gly Gly Pro Leu Val Leu Lys His Leu Thr Ala 
195 200 . 205 

Leu He Lys Ser Gin Glu Lys Val Gly He Val Gly Arg Thr Gly Ala 
210 215 220 

Gly Lys Ser Ser Leu He Ser Ala Leu Phe Arg Leu Ser Glu Pro Glu 
225 230 235 240 

Gly Lys He Trp He Asp Lys He Leu Thr Thr Glu He Gly Leu His 
245 250 255 

Asp Leu Arg Lys Lys Met Ser He He Pro Gin Glu Pro Val Leu Phe 
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260 265 270 

Thr Gly Thr Met Arg Lys Asn Leu Asp Pro Phe Asn Glu His Thr Asp 
275 280 285 

Glu Glu Leu Trp Asn Ala Leu Gin Glu Val Gin Leu Lys Glu Thr lie 
290 295 300 

Glu Asp Leu Pro Gly Lys Met Asp Thr Glu Leu Ala Glu Ser Gly Ser 
305 310 315 320 

Asn Phe Ser Val Gly Gin Arg Gin Leu Val Cys Leu Ala Arg Ala lie 
325 330 335 

Leu Arg Lys Asn Gin lie Leu lie lie Asp Glu Ala Thr Ala Asn Val 
340 345 ~ 350 

Asp Pro Arg Thr Asp Glu Leu lie Gin Lys Lys lie Arg Glu Lys Phe 
355 360 365 

Ala His Cys Thr Val Leu Thr lie Ala His Arg Leu Asn Thr lie lie 
370 375 380 

Asp Ser Asp Lys lie Met Val Leu Asp Ser Gly Arg Leu Lys Glu Tyr 
385 390 395 400 

Asp Glu Pro Tyr Val Leu Leu Gin Asn Lys Glu Ser Leu Phe Tyr Lys 
405 410 415 

Met Val Gin Gin Leu Gly Lys Ala Glu Ala Ala Ala Leu Thr Glu Thr 
420 425 430 

Ala Lys Gin Val Tyr Phe Lys Arg Asn Tyr Pro His lie Gly His Thr 
435 440 445 

Asp His Met Val Thr Asn Thr Ser Asn Gly Gin Pro Ser Thr Leu Thr 
450 455 460 

He Phe Glu Thr Ala Leu 
465 470 
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